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Abstract 



In the literature, language behavior is considered rule-governed. It 
is therefore appropriate to carry out linguistic analysis in terms of rules. 
However, in dialect comparisons, listing of shared rules simply reiterates 
dialect features and does not seem to produce a synthetic picture of 
similarity or difference. To achieve such an overall measurement, we need 
to study dialects in terms of "rule-effect". Some rules affect a large amount 
of linguistic entities such as words, while others worm their way through a 
very small portion of the lexicon. Quantitative information of this sort is 
the basis for measuring rule effects. Specifically, two effect-comparison 
models are presented. One is the measurement of dialect similarity, and the 
other is the calculation of mutual intelligibility. The similarity model mainly 
tabulates the ratio of shared items over all items of concern. The mutual 
intelligibility model incorporates a weighting hierarchy that takes into 
consideration communication signal enhancement' and noise interference. 
The requirements of adequate database, the theoretical constructs involved, 
and possible pitfalls in the overall measurements are discussed. The goal of 
this research is to provide a principled way to present rule effects. 
Language typology may be studied in terms of the numerical measurements 
these effect-theoretic models produce. Thus linguistics can provide 
description of language phenomena, explain them with rules, and give a 
synthetic account of rule effects. 



1. Rules and Rule Effects 



We conventionally analyze language change in terms of rules. For example, 
and 1,1? belonged respectively to -m ending and -n ending rimes in Middle Chinese but are 
now homophones in Beijing dialect. The phonological merger of this pair and of other 
words warrants the following ixile with respect to the syllable ending: 

( I ) -m > -n 
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This rule can account for the change. And in the past we were satisfied with such 
a representation of language evolution. In fact, rules have been the main means for 
describing dialectal differences and for establishing dialect classification. For example. 
Ting (1982) lists 16 criteria for classification as variously discussed in Li (1937), Forrest 
(1948), Tung (1953), Yuan (1960) and Zhan (1981). As presented by Ting, most of the 
criteria relate to historical rules. These publications span half a century. They show the 
dominance and persistence of the concept of rules. Only a small number of the criteria 
concern dialect characteristics. We recast Ting's listing of those regarding initials as 
follows, adding "characteristics ' to highlight the criteria that are not strictly derivation 
oriented: 

(2) a. Change of Middle Chinese voiced stop initials 

b. Change of Middle Chinese bilabial stops 

c. Merger of f- and xu- — characteristics 

d. Change of Middle Chinese ^ntlS('® initials 

e. Mergei jf n- and 1- ~ characteristics 

f Change of Middle Chinese K^^^SW initials 

g. Palatalization of Middle Chinese velar initials 

h. Loss of nasality of Middle Chinese nasal initials 

i. Presence of voiced sibilants — characteristics 

Of these nine criteria, three pertain to dialect characteristics. Of course, these 
characteristics can be attributed to some historical rules. For example, in Wuhan and 
Chengdu the Middle Chinese n- and 1- initials changed to an alveolar nasal that has a 
variant 1-, hence the merger or free variation of n- and 1- (Beijing University 1962, 1989, 
henceforth the Hanyii Fatigyin Zihui). Thus this characteristics criterion reflects the 
effects of the change of these initials. However, in the past our predominant interest in 
analytical mechanisms such as rule format, rule ordering, and rule interaction somehow 
blurred the picture of the language as an integral living thing. Rule effects have not been 
the focus of linguistic inquiries. As we reviewed the historical rules implicit or explicit in 
the criteria above and in Chen (1976), Hashimoto (1979) and Tsai-fa Cheng (1985), we 
became more curious about what could be said regarding the consequences of those 
changes. 

Let us return to the -m and -n merger. Historically that was what happened to 
Beijing and most other Northern Dialects. What effects can we describe? Naturally, the 
most obvious effect of this rule is that there are no more syllables ending in the bilabial 
nasal That is one statement we can make to describe the dialect of Beijing. A more 
significant statement is to present the general phonological constraint that the bilabial nasal 
cannot occur in syllable-coda position. This constraint will then explain, for example, why 
the -m syllable-ending of loan ."ords from another diale'-* or from transliteration of other 
languages has to be substituted by an alveolar or velar nasal. 



Indeed, linguists have talked about constraints as language characteristics. Other 
characteristics such as word order, ergativity, case marking, and relative clauses have been 
the bases for making typological statements (Comrie 1989). In addition to those 
properties listed in (2), presence or absence of closed syllables, number of tones, etc. have 
been utilized to establish dialect classification. From the -m and -n merger we see that 
language characteristics may be introduced or changed by rules. However, characteristics 
are not the only type of effects we are interested in studying. As we look beyond the rules 
of a single language, cross-language comparisons in lerms of rule effects can be discussed 
in several ways. Traditionally, typological studies investigate language differences and 
make statements on presence or absence of certain characteristics. Recently, we have 
ventured into the area of quantitative measurements, hoping to answer some of the most 
frequently asked questions concerning Chinese: How different are Chinese dialects? Are 
they mutually intelligible? Our conventional answers to these questions usually reiterate 
the classification criteria. For example, we would say that in Northern Dialects the Middle 
Chinese -m ending has merged with -n while in the dialects in the south -m remains. That 
shows dialectal differences, indeed. But does that merger make northern dialects 
unintelligible to southern dialect speakers? Does that rule make northern and southern 
dialects very different? What is the degree of difference? Our quantitative studies of 
Chinese dialects have attempted to establish some methods for numerical measurements of 
dialect similarity and mutual intelligibility (Cheng 1982, 1992, 1994). We hope to be able 
to discuss rule effects quantitatively and therefore more definitively. We feel that the 
following are interesting and fertile areas to focus our attention and to raise new sets of 
linguistic questions. 

(3) Rule effects on dialect 

a. characteristics — qualitative-quantitative statements 

b. similarity ~ quantitative measurements 

c. mutual intelligibility — quantitative measurements 

Typological studies of various languages in the past have provided abundant 
examples of how to compare language characteristics qualitatively. We have indicated 
above how rules change language constraints and other characteristics. Therefore we 
propose that both rules and rule effects be stated to give a fuller picture of a comparison. 
In (3) we label this comparison as a type of qualitative-quantitative investigation. In 
reality, quantitative information is often implicitly used. For example, to say that in 
Beijing the syllable coda cannot be -m means that all the syllables ending in -m earlier in 
history have been changed to end in -n. "All", "none", and "some" are quantifiers. Thus 
judgments on language characteristics often take quantity into consideration. This type of 
quantification, however, has been used inconsistently, with varying degrees of precision 
and verifiability. The discussion above has shown that rule effects should be an important 
part of a rule analysis. In the remainder of this paper we will examine the possibilities of 
quantifying rule effects on dialect similarity and mutual intelligibility. 
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2. Rule Effects on Dialect Similarity 



We will examine the effects of the rule contained in criterion (la) pertaining to the 
historical change of voiced stop initials. It is well known that the voiced stops have 
remained in Wu and some Xiang dialects as voiced but have become devoiced in other 
dialects. To see the effects of devoicing, we will specifically study Beijing, a Northern 
Dialect, and Suzhou, a Wu variety. 

To see the extent the change of voiced stops affects the similarity between Beijing 
and Suzhou, we need to define the carriers of the change. A phonological change such as 
this one is carried by words. In Chinese a word is normally coterminous with a syllable. 
Thus we may use "word", "syllable", and "lexical item" interchangeably. We have 
tabulated the occurrences of items in relation to these Middle Chinese initials. In the 
following listing the first consonant in each line is for Middle Chinese. The modem 
reflexes are given after the colon. The number of items affected are given under Beijing 
and Suzhou separately. The database is the DOC (Dictionary on Computer) file that has 
been partially updated according to the second edition of the Hanyu Fangyin Zihui. 









Beijing 


Suzhou 


a. 


b 


b 




82 


b. 


b. 


P 


38 




c. 


b 




39 




d. 


b 


f 


2 




e. 


d 


d 




107 


f 


d 


t 


56 


1 


g 


d 


t" 


53 


2 


h. 


g 


8 




6 


i. 


g 


■k 




55 


j- 


g 


tc 


27 


2 


k. 


g 




29 




1. 


g 


k 


3 




m. 


g 




2 




n. 


Total 


249 


255 



First, the two total numbers are different because alternative readings for individual 
words are included. The devoicing in Beijing actually involves two features. The voiced 
stops became voiceless aspirated in Even tone and voiceless unaspirated in Oblique tones. 
All the items involved are uniformly voiceless and thus the rule has created the 
phonotactic that there are no voiced stops in modem Beijing. The aspiration part of the 
change is not without exception, for example, f$ and ^, which were in an Oblique tone 
and are now pronounced with an aspirated alveolar stop. Furthermore, in both Beijing 
and Suzhou, palatalization of the velar initials has occurred. The f- initial items are the 
two readings of the word iiyV in low tone and in high falling tone. 
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The five items pronounced with voiceless initials in Suzhou are shared by Beijing. 
However, they perhaps should be excluded from this table. In Suzhou the item 
pronounced with a voiceless unaspirated alveolar stop and both with a voiceless 
aspirated alveolar stop might not have been derived fi-om the voiced alveolar stop. The 
words jg and ^ both with a voiceless palatal affricate appeared in the first edition of the 
Hanyii Fangyin Zihui but have been deleted in the second edition. If we omit these five 
items from consideration, then regarding the evolution of the Middle Chinese voiced stops, 
Beijing and Suzhou do not share any items. They are entirely dissimilar. 

The total dissimilarity here is obvious by inspection of the disjunctive occurrence 
of the items. But when the items involved are numerous and the occurrence patterns are 
complex, we need to have a formula to calculate the degree of similarity. Similarity 
measurements are mostly based on the ratio of shared items to the total number of items 
considered. The "total number of items considered" is somewhat tricky. As we compare 
several dialects, say dialects A, B, C, and D at the same time, the items that occur in 
dialed C or D but do not occur in dialects A and B, might be counted in the total number 
of items considered when we compare dialects A and B. Ma (1989), Tu and Cheng 
(1991), Wang and Shen (1992), and Tu (1994) have reviewed various correlation methods 
and have pointed out such inflation of coefficient values in Cheng (1982). Now it seems 
that we have come to favor Jaccard's coefficient, which excludes the non-occurring items 
in the computation; 

(5) a/(a + b + c) 

where a: number of items shared by both dialects 

b: number of items occurring in one dialect only 

c: number of items occurring in the other dialect only. 

The calculation of similarity based on the numbers given in (4) for the effects of 
the change of the Middle Chinese voiced stops is either (6a) if we exclude or (6b) if we 
include those five items: 

(6) a. 0 / (0 + 244 + 250) = 0 

b. 5/(5 +244 + 250) = 0.010 

By definition, this similarity index ranges fi-om 0 to 1 . Thus the historical devoicing rule 
contributes no or extremely small value of similarity between Beijing and Suzhou. An 
overall similarity comparison will have to take consider more cases. In Cheng (1991) 
3,373 cases of initials, finals, and tones were used to calculate a phonological similarity 
matrix for 17 Chinese dialects. In the literature, presentation of the number of instances 
attesting to historical correspondences is quite common. For example, just to be critical of 
ourselves, Cheng and Wang (1971), Chen (1976), and Wang and Cheng (1987) have 
extensive lists of numbers variously showing correspondences for initials, finals, and tones 
between Middle Chinese and modern dialects. But those numbers are simply numbers of 
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instances; no principled ways of synthesis are given in those studies. Here in this paper we 
are using devoicing as an example to show how to quantify rule effects. 

The use of the DOC database for quantification of similarity, affinity, and mutual 
intelligibility deserves some comments. The items in the database were collected from the 
first edition of the Hanyu Fangyin Zihui and partially updated on the basis of the second 
edition. T'he Hanyu Fangyin Zihui contains phonological information for over 27,00 
common words. Those words were not selected according to some sampling principles. 
Consequently one could question the validity of the data as a fair sample for prediction of 
the nature of the dialects. Selection of linguif" ; data for quantification has always been a 
substantive as well as methodological issue. The "basic" lexicon in glottochronology 
limiting the size to a couple hundred items would not be a good representative for our 
purposes. We maintain that the larger lexicon of the Hanyu Fangyin Zihui would allow us 
to make various sorts of inquiries. Much as we wish to claim the predictive power of our 
quantification, a moderate view of taking this research as a population study of this 
particular collection of data would help us jump over the hurdle of statistical sampling and 
allow us to venture into different modes of linguistic inquiry. 

3. Rule EfTects on Dialect Mutual Intelligibility 

Since Middle Chinese, historical rules have changed linguistic entities and patterns 
of the speech in various reg'ons. Deterioration and enhancement of dialect mutual 
intelligibility are the most obvious effects of the rules such as those given earlier in (2). 
Mutual intelligibility, in spite of its vague definition in the past, has been used as a criterion 
for language subgrouping by linguists. Social scientists and non-professionals often 
demand a definitive answer from linguists to their questions about mutual intelligibility. 
We have poked around for years; it is time for us to try to answer this challenge. The 
motivation, weight assignments of characteristics, and procedural details for calculating 
mutual intelligibility have been presented in Cheng (1992, 1994). In essence, we take the 
view that human pattern cognition is based on observation of repeated phenomena. That 
is, repetition lends its weight of numbers to pattern formation. In dialect communication, 
the recurrence of corresponding elements on the basis of cognates, such as Beijing initial 
p- to Jinan p- and Beijing n- to Jinan 1-, forms correspondence patterns. Some 
correspondence patterns involve many members such as words while others contain only a 
small number of entities. It is therefore useful to divide patterns into major and minor 
ones. Major patterns give a sense of regularity and therefore are considered as 
communication enhancing signal. On the other hand, minor patterns are exceptions and 
can act as interfering noise. 

Intuitively we feel that between a pair of dialects, say A and B, the intelligibility of 
dialect B for dialect A may not be identical to that of dialect A for dialect B. Hence we 
use the term "source dialect" and "target dialect" to refer to the way corresponding 
patterns are established. First we set up the patterns according to the elements in dialect 
A. We then calculate the one-way unidirectional intelligibility value. Then we collect the 
correspondence patterns according to the elements in dialect B and calculate the 
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intelligibility. This differentiation thus recognizes the needs to derive unidirectional 
intelligibility as the first step of the calculation of mutual intelligibility. We take the mean 
of the two unidirectional intelligibility degrees to be the mutual intelligibility of the two 
dialects. A crucial issue of the calculation is the determination of importance or weight of 
various correspondence patterns. We have established a weight scale in Cheng (1992, 
1994). The scale using a unitary 1 as the full value takes into account the type of 
correspondence patterns (signal or noise) and the nature of the corresponding items (same 
or different). When the dialects have the same items in a major pattern, the intelligibility is 
the highest, for example, Beijing p- corresponding to Jinan p-. If the target-dialect 
element is different from that of the source dialect and that element occurs elsewhere in 
non-cognate items in the source dialect, then the coiifusability is the highest. For example, 
the correspondence of Jinan 1- to Beijing n- involving a single item might allow Beijing to 
wrongly take that item as an item in Beijing 1-. Other situations obtain more moderate 
values. The weight scale is as follows: 

(7) Signal Noise 

For each item in a pattern, the target-dialect 

a. element is the same as that of the source dialect: 1.00 -0.25 

b. element is different from that of the source dialect 

i. and does not occur in the source dialect: 0.50 -0.50 

ii. and occurs elsewhere in the source dialect: 0.25 -1.00 



In (2) we list the dialect subgrouping criteria concerning initials. In order to 
discuss some of them to show rule effects, we need to give proper weight to initial 
consonants. Since we use cognate syllable-words to establish correspondence patterns, 
we may assign one-fifth of the unitary value 1 to each of the five traditional segments of 
initial, medial, nuclear vowel, ending, and tone. This weight scale for initials is the 
following: 



(8) Signal Noise 
For each item in a pattern, the target-dialect's phonological 

a. element is the same as that of the source dialect: 0.20 -0.05 

b. element is different from that of the source dialect 

i. and does not occur in the source dialect: 0.10 -0.10 

ii. and occurs elsewhere in the source dialect: 0.05 -0.20 

Now let us examine how the devoicing rule affects the mutual intelligibility 
between Beijing and Suzhou as an example to show rule effects. First, patterns of sound 
correspondence on the basis of cognate words have to be established. We will use Beijing 
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as the source dialect and Suzhou as the target dialect to inspect one-way intelligibility. 
Let us start with the bilabial. The Middle Chinese voiced bilabial stop has changed into 
voiceless unaspirated stops in Oblique tones. This is part of the generalization we made 
earlier concerning the devoicing rule in Beijing. However, the p- initial from Middle 
Chinese p- has remained unchanged in both Beijing and Suzhou. The Beijing p- therefore 
corresponds to Suzhou b- and p-. Thus, one effect of the devoicing is the merger of the 
earlier p-b distinction within Beijing and the creation of p:b and p:p correspondence 
patterns between Beijing and Suzhou. Following are the patterns showing the voiceless- 
voiced correspondence containing 41 items and the voiceless-voiceless correspondence 
having 93 items of cognate words in our DOC database. 

(9) a. p : b 41 
b. p : p 93 

From the point of view of Beijing p-, the correspondence patterns have a mean of 
67 ((41 + 93) 12). The p:b pattern with 41 items is less than the mean. Therefore this is a 
minor pattern and is considered noise, interfering with intelligibility. Since the initial b- 
does not appear in Beijing, the unit value is -0.10 according to the weight scale. The 41 
items yield a value of -4.10. The p.p pattern with identical initials has a frequency greater 
than the mean, and therefore it is considered communication enhancing signal. Its unit 
weight is 0.20 and its value is 93*0.20=18.60. So regarding Beijing p-, one pattern 
contributes negatively and the other positively to intelligibility. To show the cumulative 
effects, we add "noise" and "signal" columns below. The values in these two columns will 
accumulate as we proceed to show other patterns. At the end of the calculation, we will 
see the cumulative values as the numerical effects of the devoicing rule. 

(^0) frequency mean weight value noise signal 

a. p ; b 41 67.0 -0.10 -4.10 -4.10 

b. p : p 93 67.0 0.20 18.60 18.60 

As said before, the devoicing for words in Even tone has produced voiceless 
aspirated stops. We need to inspect the Beijing-Suzhou correspondence patterns for 
Beijing p^-. The frequency, mean, weight, value, and cumulative sums of noise and signal 
in that order are given below: 



(•U frequency mean weight value noise signal 

b 44 30.6 0.10 4.40 23.00 

p 4 30.6 -0.20 -0.80 -4.90 

44 30.6 0.20 8.80 31.80 



a. ^ 
b. 



c. 



This time the weight for the pattern is 0. 10 because its occurrence is greater than the 
mean but the initials are different. The 4 items of the p^:p correspondence are irregular in 
two respects. First the irregularity may be due to the informant's idiosyncratic speech 
For example, nn, one of the 4 items, was given with p- in Suzhou in the first edition of the 
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Hanyu Fangyin Zihui and with pf- in the second edition. Another cause of this odd 
correspondence might be due to irregular change. For example the word |^ somehow 
acquired aspiration in many dialects except Suzhou and Wenzhou. Fortunately the 
frequency is too small to skew the figure drastically. The assignment of weight and the 
derivation of the value for the f.^' pattern are straightforward. The noise and signal values 
are cumulative from those given in ( 1 0). 

Earlier we showed :hat t^. had two readings with f- in Beijing. It has a b- initial in 
Suzhou. In Suzhou the initial of the word Wi- somehow has changed from a. Middle 
Chinese fricative to b-. Thus we found Beijing f- corresponds to Suzhou b- with 3 items. 
The correspondence patterns for Beijing f- and their numerical values are give below: 

(12) frequency mean weight value noise signal 

a. f : b 3 32.6 -0.10 -0.30 -5.20 

b. f : f 57 32.6 0.20 11.40 43.20 

c. f : V 38 32.6 0.10 3.80 47.00 

The avid reader will question the inconsistency of the numbers given so far. In (4) 
we show that Suzhou has 82 items with b- initials derived from Middle Chinese b-. And 
yet here we show 88 (41 + 44 + 3) items with b- in Suzhcu. The difference is due to the 
fact that we are looking at slightly different matters here. In (4) we give the historical 
origin. In (10), (1 1), and (12) we show the modem b- initials irrespective of their history. 
The occurrence of 88, then, means that the initials of six words have changed from 
voiceless to voiced. As an example, one of these words is 'j^, which had the voiceless 
unaspirated bilabial stop in Middle Chinese. 

So far we have accumulated 47.00 positive and -5.20 negative points for the 
effects of the devoicing rule. The numbers have to be interpreted in the context of all the 
phonological correspondence patterns between Beijing and Suzhou. To anticipate the 
results, we should say that there were 2,916 syllable-words in the database in the Beijing- 
Suzhou case. Since in the weight scale we assign 1 unit value to each syllable, by 
definition the maximum sum of noise and signal is 2,916 in this case. We may call the sum 
value of signal enhancement and noise deduction the "signal-noise value". The signal- 
noise value will be less than the maximum, unless the two dialects are identical in every 
respect, in which case the value would be the same as the number of syllables. The 
normalized unidirectional intelligibility index is obtained by dividing the cumulative signal- 
noise value by the total number of elements involved, in this case the elements being 
syllables. 

Up to this point, the signal-noise value is 41.8 (47 - 5.20). To continue the 
investigation of the effects of the devoicing rule, we need to examine the signal-noise 
values of all the patterns that involve the modern initials as listed under Beijing in (4). 
Besides p, f and f, whose correspondence patterns have been tabulated above, we will 
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quantify the signal-noise values of the patterns for Beijing initials t, t^, k, k*, tc, and Kf below 
without comments on exception or irregular items. 



(13) 








frequency 


mean 


weight 


value 


noise 


signal 




a. 


* 

I 


a 


JO 


40. 0 


U. lU 


^ 8A 

J.oV 




^9 fiA 




b. 


t 


t 


O 1 

81 


46.6 


0.20 


1 ^ or\ 

16.20 




/Cr> AA 

6V.00 




c. 


t 




1 


46.6 


-0.20 


A OA 

-0.20 


-5.40 




(14) 








frequency 


mean 


weight 


value 


noise 


signal 




a. 


: 


d 


53 


59.0 


-0.10 


-5.30 


-10.70 






D. 


r 


I 


OJ 




U.2u 


1 1 AA 
13.UU 




QO AA 

o2.UU 


(15) 
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value 


noise 


signal 




a. 


k . 




2 


34.5 


-0.10 


-0.20 


-10.90 






b. 


k : 


g 


4 


34.5 


-0.10 


-0.40 


11 ^ /\ 

-1 1.30 






c. 


k : 


k 


129 


34.5 


0.20 


25.80 




107.80 




d. 


k : 


tc 


3 


34.5 


-0.20 


-0.60 


-11.90 
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frequency 


mean 


weight 


value 


noise 


signal 




a. 




g 


4 


15.2 


-0.10 


-0.40 


-12.30 






b. 


li. 

IC 


h 


1 


ICO 

15.2 


-0.10 


-0. 10 


1 O A /\ 

-12.40 






c. 


ic 


I, 

K 


i 


ICO 

1 5.2 


r\ or\ 
-0.20 


A A.r\ 
-U.60 


1 O AA 
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1/* 
It 


IC 


O / 


ICO 


f\ OA 

U.2U 


13.40 




1 O 1 OA 

121 .20 




e. 


1/* 
IC 


to 


1 
1 


1 c o 
15.2 


A OA 

-0.20 


A OA 

-U.2U 


1 1 OA 

-13.21) 
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tc 


G 


i 


33.3 


-0.20 


-0.20 


-13.40 






u. 


Iv 






11 1 


n 1 r> 


9 ^A 


1 ^ OA 






c. 


Iv 


u 
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1 
1 


11 1 


ATA 
-U. lU 


ATA 
-U. lU 


1 ^ AA 

- lo.uu 






A 
U. 




L- 
K. 


HZ 


11 1 
33.3 


A A^ 

U.U J 


9 1 A 




1 91 1A 
1 Z3 .3U 




e. 


u 


u 


1 


11 1 

J3.3 


A OA 
U.ZU 


9Q 9A 




1 ^9 <\A 






Iv. 


tc 
a 


17 

J / 


11 1 
33.3 


A A^ 

U.U J 


1 8^ 

1 .OJ 




1 ^A 1^ 




CI 

8- 




ic'' 


1 
1 


11 1 
33.3 


A 9A 
-U.ZU 


A 9A 


1 fi, 9A 






u 

n. 




Z 


1 A 
1 H 


11 1 
33.3 


A 1 A 
-U. 1 U 


- 1 .HU 


1 7 AA 
- 1 / .OU 




(18) 
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a. 


It'' 


: dz 


28 


16.2 


0.10 


2.80 




157.15 




b. 


!c' 


: g 


1 


16.2 


-0.10 


-0.10 


-17.70 






c. 




: i^- 


7 


16.2 


-0.20 


-1.40 


-19.10 






d. 




: tc 


3 


16.2 


-0.20 


-0.60 


-19.70 






e. 


to' 


; to' 


34 


16.2 


0.20 


6.80 




163.95 




f 


to' 


: 


28 


16.2 


0.05 


1.40 




16*^.35 




g 


to' 


z 


13 


16.2 


-0.10 


-1.30 


-21.00 
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At this point we have completed the tabulation of the correspondence patterns of 
the initials involved in the devoicing rule. Thus the positive contribution of the devoicing 
rule is 165.35 and its negative contribution is -21.00. The signal-noise value is therefore 
144.35 (165.35-21.00). As said before, the numbers have to be interpreted in the context 
of all the phonological correspondence patterns between Beijing and Suzhou. There were 
2,916 syllable-words in the database, and by definition the maximum sum of the signal- 
noise value is 2,916. We have tabulated all the phonological correspondence patterns 
including the ones given above. The sum of positive and negative values is 1486.35. The 
one-way intelligibility index for Beijing-Suzhou is 0.510 (1486.35 / 2916). Out of 
1486.35 the devoicing rule contributes 144.35. This value is about 10% (144.35 / 
1486.35 = 0.097) of the total. 

4. Numerical Measurement as Synthesis 

With our similarity measurements, the devoicing rule makes Beijing and Suzhou 
entirely dissimilar. In terms of mutual intelligibility, the effects participate in contributing 
about 10% of the communication enhancing value. This seeming contradiction arises 
because when we talk about correspondence patterns we include those elements that were 
produced by the rule as well as those outside of its application. Moreover, our mutual 
intelligibility measurement does not require identical corresponding elements to enhance 
communication. As long as a pattern has more than the mean number of elements, it 
becomes a regular correspondence. A regular correspondence pattern enhances 
communication. That is how we can understand other dialects and speech with a foreign 
accent. The t:d pattern given above, in spite of its differing elements, is considered signal 
and contributes a positive value of 5.8. The same positive effect can be said of the f.h 
correspondence. On the other hand the p;b and t^:d patterns contribute negatively as 
expected. The merger of initials has such complex consequences. We think numerical 
measurement is a way to represent the synthesis of various forces upon the dialects. 

What does it mean to say that the Beijing-Suzhou unidirectional intelligibility is 
0.510? Does it mean that half of the speech of Suzhou dialect can be understood by 
people from Beijing? As discussed in Cheng (1992, 1994), personal understanding of 
other dialects involves factors such as language background, expeiience in non-native 
environment, individual ability, etc. We call such mutual intelligibility "subjective mutual 
intelligibility". Here in this study we are looking at dialects as systems. The mutual 
intelligibility so calculated can be called "systemic mutual intelligibility". As speakers of a 
dialect, in spite of personal differences, are confined by or endowed with the dialect 
system, the calculation of subjective mutual intelligibility has to be based on systemic 
mutual intelligibility. However, we do not know exactly how to calculate subjective 
mutual intelligibility yet. To say that we have yet to crystalize measurement ideas would 
sound odd. But that is true at this initial stage of the quest for a quantitative synthesis of 
language similarity and mutual intelligibility. 

For now, we will show in the Appendix the dialect subgrouping based on the 
calculated mutual intelligibility indices for all the 17 dialects represented in the Hanytt 



Fangyin Zihui so that the Beijing- Suzhou values can be compared to the overall picture. 
The details of the procedure for arriving at the subgrouping can be found in Cheng (1992, 
1994). Of relevance here is that the Beijing-Suzhou unidirectional intelligibility is 0.510 
with Beijing as the source dialect and 0.489 with Suzhou as the source dialect. The 
mutual intelligibility is the mean value of 0.499. This value can be compared with the 
highest intelligibility pair of Hankou and Chengdu, having the mutual intelligibility of 
0.795. The lowest intelligibility pair, Shuangfeng-Chaozhou, has a value of 0.353. 

The numerical exercise above illustrates that rules are not isolated events. They 
may cause complex interactions in elements, which our conventional rule format cannot 
specify. Interests in rule effects have led us to relate rules to dialect similarity and mutual 
intelligibility. Are we rewriting rules in linguistics? That is for the reader to judge. 
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1. Introduction 

Probably most linguists would agree, in the abstract, that any approach to language 
analysis must pay due attention both to the similarities and to the differences among languages. 
However, when it comes to implementing this policy, we find substantial differences between 
the practice of different linguists, corresponding not only to differences among different 
schools, but also to differences within the same school at different times, and even to differ- 
ences among individual linguists within the same school at the same time. At the one extreme, 
one might believe that all languages are essentially the same, with differences among lan- 
guages, while of course there, not being particularly significant. The linguist who follows this 
line of argument will typically assume tiiat whatever categories have proved necessary or useful 
in describing language A must also be present in language B. If they are not immediately ap- 
parent in language B, then they must be present in some more covert form. If they do not prove 
necessary, or even useful, in the analysis of language B, then this is not taken as evidence that 
they do not exist in language B; rather, this state of affairs is interpreted as an indication that the 
categories are present in language B, but are simply not made use by the grammar of language 
B. At the opposite extreme, one might assume that no category that is not immediately apparent 
in the structure of language B is necessarily absent from language B, following the slogan "if it 
seems not to be there, then it isn't there". 

I believe that both of these extreme approaches are misguided. If we start out by assum- 
ing that the properties of language A are necessarily relevant for language B, and never ask 
ourselves what the justification for these properties might be internally to language B, then we 
can continue indefinitely holding to this belief, irrespective of whether or not this is the best 
analysis of language B. Thus if someone believes, on the basis of morphological case distinc- 
tions in Latin, that English has a covert distinction between dative (roughly, 'to') and ablative 
(roughly, 'from') cases, then I doubt if the English language will ever present him with any 
evidence that this analysis is wrong-headed. But there is a rather obvious alternative analysis 
that does much better justice to the facts of English, namely that English makes use of different 
prepositions, rather than different cases, to make this distinction. TOs is an important method- 
ological point. Many linguists hold to the view within philosophy of science that a particular 
analysis is tenable until disproven. Unfortunately, with analyses that have no empirical conse- 
quences — like the alleged covert dative-ablative opposition in English — even a rather silly 
analysis will never be disproven. I have deliberately chosen a "silly" analysis to make a clear 
point, although in what follows I will note some instances that are quite parallel although not so 
obviously wrong-headed. 

Let me give a more subtle example that may illustrate the same phenomenon, and which 
involves a contrast between English and Chinese.^ In English, there are good reasons for dis- 
tinguishing syntactically between the two kinds of adnominal clauses found in (1) and (2): 



'I originally developed this particular argument using the contrast between English and Japan- 
ese, although in this particular respect Chinese and Japanese seem to have essentially the same 
property, in contrast to English; a written version of the original argument is currently under 
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(1) the opinion that I put forward 

(2) the opinion that they should go to America to study medicine 

Sentence (1) is a relative clause, while (2) is a complement clause with a nominal head. The 
syntactic differences between the two constructions can easily be demonstrated. In (1), that can 
replaced by which, as part of the usual rule whereby a relative clause can be introduced by 
the relative pronoun who or which, depending on the animacy of the head noun, as in (3); this 
is not possible in (2), whence the ungrammaticality of (4): 

(3) the opinion which I put forward 

(4) *the opinion which they should go to America to study medicine 

If we compare the relative clause in (3) with the closest corresponding simple sentence, then 
we see that this simple sentence requires an noun phrase in the direct object position after the 
verb put forward (as in (5)); this in keeping with the analysis of both traditional and more re- 
cent grammatical studies whereby, depending on details of the analysis, the relative pronoun in 
(3) is, in some sense, the direct object of the verb put forward, or the relative clause contains a 
gap corresponding to the direct object position. 

(5) I put forward the opinion. 

In (4), by contrast, there is no gap; the subordinate clause, minus the conjunction, is a perfectly 
well-formed sentence in isolation; there is no possibility of inserting a noun phrase into this 
clause that would correspond to the diat of (2): 

(6) They should go to America to study medicine. 

What about the corresponding constructions in Chinese, as in (7)-(8)?^ 

(7) w6 tfchuMi-de yiji&n 

I put.forward-PTCL opinion 

(8) tamen qil MSigud xu^ yi-de yiji^ 
they go America study medicine-PTCL opinion 

In Chinese, there is no obvious difference between the two sentence types. In both cases, we 
have a head noun which is preceded by a clause, the two being linked by means of the particle 
de. Moreover, given that Qiinese has the phenomenon of null-anaphora (zero-anaphora), it is 
at least harder to justify an analysis whereby there is a gap in (7), contrasting with the absence 
of a gap in (8). Finally, the construction with a nominal head as (8) is not how clausal com- 
plementation with a verbal head is done in Chinese — the latter involves no particle, and the 
subordinate clause follows rather than precedes the main clause, as in (9); thus, Chinese lacks 



preparation for a publication of the European Science Foundation Programme in Language 
Typology (EUROTYP). Except where otherwise specified, I will use "Chinese" as an abbre- 
viation for "Mandarin Chinese", or rather, even more specifically, "Modem Standard (Man- 
darin) Chinese". 

^The following abbreviations are used: ABL — ablative, ACC — accusative, ASP — aspect, CL 
— classifier, DAT— dative, DO — direct object, GEN — genitive, LOC — locative, M — ^mascu- 
line, NOM — nominative, PASS — passive, PL — ^plural, PRS — present, PTCL — particle, SG — 
singular, TOP — topic. 
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the parallelism between the nominal-head construction and the verbal-head construction that is 
found in English. 

(9) Ta ti'yi w6men dou qCl chi jiSozi. 

he suggest we all go eat dumpling 
'He suggests that we all go and eat dumplings.' 

Nonetheless most recent discussions of such Chinese constructions with which I am famU- 
fa^^nd I L only lament the fact that, given my own limitations, this necessarily excludes 
Ssions h Chines^make the assumption that Chmese has distinc consmictions of rela- 
tive Sean^ complement clause with a nominal head.^ However, an alternative analysis is at 

feit Xsi'e ^^^^^^^^ - (7) and (8) one has the f^.^y-'^^l^ZT^^Snl 
a head noun with a modifying clause; what the speaker of Chinese then ^o^f 1 ° 
nlausible interoretation to this structure, which in the case of (7) leads to the jelative-ciause 
SemrS of (8) to the "complement clause" interpretation-4hus there is no 

So S'^y syntactic difference betweeJ the two types Of course, before adop mg th^ 
Ssisk would be necessary to test its consequences. But the point I wan to make is not to 
S *J eXr the "two-constructions" or the "one-construction" analysis is the correct anal- 
S for Aese examples in Chinese. Rather, I want to suggest that because a particular distmc- 
don is made in English, this distinction has been carried over mto the analysis of Chinese, 
S?ut s^ppTng S 'whether this distinction is really justifiable indeed without stopping to 
rrwheSe distinction as applied to Chinese may not actually lose a sigmficant generaliza- 
tion concerning the structure of that language. . »u * 

On the other hand, assuming that we can only justifiably make use of categones that are 
clearly overt in a particular language can equally lead to missed insights. For ^stance, m nwiy 
laSges in particular several older or more archaic European languages, one of the inos 
obvlouf criteria for grammatical relations, such as the distinction between subject direct 
S ' is Sholofical case. English for the most part lacks morphologica case However, 
Soes nS mean that English lacks grammatical relations, since as I wiU >1 ustm^^^^^^^J^^" 
2 English has a number of surprisingly robust criteria for identifying subjects; it just so hap- 
pens that morphological case plays a minor role in this identification. 

2. A Latin Grammarian Looks at Englisli 

In this section, I want to imagine the case of a Latin grammariai>-a linguist who is 
both a native speaker of Latin and a scholar of the Latin language-who turns to examming 
modern English This is, needless to say, a chronological impossibility but is nonetiieless m- 
Sghtful as I "thought experiment", especially insofar as it contrast the structures of two of the 

3Many readers will no doubt recognize these Chinese sentences as taken from Li and Thomp- 
son (1981: 586-587). Interestingly, although Li and Thompson m general do not adopt the 
v^w that the linguistic structure of Chinese has to be analyzed in the same way as tha of 
English, in their discussion of these examples they do insist on the distmction between the two 
kinds of construction in Chinese. However, the cntcria they give arc semantic, so their analysis 
is still consistent with the view that thsre is no syntactic difference between the two sentence- 
types. 

-♦in presenting arguments below, I will often use the terms "subject" and "direct object" not 
only in the strict sense where their identification has been jusified in the language or construc- 
tion at issue, but also more informally where my interest is in arguing whether or not these 
grammatical relations are justifiable, and where a more strict terminology might use putative 
subject" and "putative direct object" 
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most thoroughly investigated of the world's languages. Let us suppose that our Latin grammar- 
ian is particularly interested in grammatical relations. 

In Latin, an obvious criterion for subjects and direct objects, which works in the vast 
majority of instances, in particular in finite clauses, is case marking: Subjects stand m the 
nominative case, while direct objects stand in the accusative case, as in (10): 

(10) Marcus Titum ferit. 

Marcus.NOM Titus.ACC hit-PRS.3SG 
'Marcus hits Titus.' 

Indeed, the identification of grammatical relations by means of cases is so strong in Latin that it 
overrides other considerations, such as word order, so that (11) has the same basic meaning as 

(10) , while to express the idea of Titus hitting Marcus it is necessary to change the cases, again 
irrespective of word order: 

(11) Titum Marcus ferit. 
Titus.ACC Marcus.NOM hit.PRS.3SG 
'Marcus hits Titus.' 

(12) Titus Marcum ferit. 
Titus.NOM Marcus.ACC kill.PRS.3SG 
'Titus hits Marcus.' 

One of the first things that strikes our Latin grammarian is that in the English translation 
equivalents of these sentences, there are no morphological case differences between the noun 
phrases Marcus and Titus, whether they stand as translation equivalents of a Latin nominative 
or of a Latin accusative. What conclusions might our grammarian draw from this typological 
difference between the two languages? There are at least two initial reactions that he might 
have. The first is to conclude that English "really" does have the morphological distinction 
between the two cases that is found in Latin, and therefore the difference between the two 
grammatical relations of subject and direct object, but that this distinction in English is 
"covert". The second is to conclude that English lacks the most salient distinction found in 
Latin between subject and direct object, namely case marking, and therefore also lacks the dis- 
tinction between subject and direct object, if indeed it does not lack grammatical relations alto- 
gether. In section 1 1 mentioned briefly an example, that of adnominal clauses, v^here the lack 
of an opposition found in one language (English) but not in other (Chinese) has arguably led 
linguists to misanalyze Chinese by transferring the English distinction to Chinese. In the Latin- 
English comparative example, I will be arguing almost the inverse, namely that English does 
have the distinction between subject and direct object found in Latin, although its manifestation 
is somewhat different, and indeed some of the details of the assignment of grammatical rela- 
tions are also different between the two languages. Thus, it is necessary to examine each indi- 
vidual language carefully before concluding that distinctions valid in some other language are 
or are not valid in the language now under investigation. 

Let us therefore follow our Latin grammarian as he finds out more about English. The 
data are, of course, not new, but it is important for our own learning experience to imagine 
ourselves following in the footsteps of the Latin grammarian. The English data that follow arc 
introduced not in order to establish new generalizations about English, but rather to illustrate a 
methodological point, ^ 



^The relevant sections of Postal (1974) constitute a useful checklist of subject properties in 
English, although some of the properties alluded must necessarily be examined against the 
background of the author's theoretical commitment at the time and of the particular argument he 
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The Latin grammarian will soon notice that most personal pronouns in English, in con- 
trast to non-pronominal noun phrases, do show a distinction in form between subject and direct 
object, as in (13) and (14): 

(13) I hit him. 

(14) He hits me. 

The distinction is very similar to the case distinction found in Latin with the majority of noun 
phrases, although there are some differences, for instance in that the Latin accusative me 'me' 
contrasts with other non-nominative cases, such as dative mihi 'to me', whereas English only 
has the binary distraction / — me. But at least the case distinction does allow us to identify pro- 
nominal subjects unequivocally in terms of their morphological form. Our Latin grammarian 
will notice that in English, as in Latin, the grammatical relation of subject does not correspond 
to any single semantic role, so that generalizations made about subjects cannot be readily re- 
placed by statements about semantic roles; good evidence for this is provided by the nominative 
subject-patient m passive sentences like (15): 

(15) I am hit by him. 

The Latin grammarian, here even thinking on the basis of his own analysis of his native lan- 
guage, will recall that verb agreement is also a good criterion for subjects in Latin, at least in 
finite clauses, where verbs agree in person and number with their subject, and with no other 
noun phrase. (Latin non-finite verb forms do not show any person-number agreement.) Al- 
though English verb morphology is much reduced in comparison with that of Latin, where 
English does have distinct person-number forms of verbs, they show agreement with the sub- 
ject and only with the subject, including subjects expressing different semantic roles, including 
in particular subjects of passive sentences. Thus, in (14) the verb form hits unequivocally 
shows third person singular agreement, i.e. it agrees with the subject he and does not agree 
with the first person singular direct object. Likewise, in (15) the verb am clearly shows agree- 
ment with the subject-patient /. 

As our Latin grammarian becomes more familiar with English and its structure, he will, 
at least with some degree of perseverance and insight, uncover other, much more surprising 
criteria that enable one to identify subjects in English. For instance. Quantifier Float is the name 
given to the construction illustrated by (17) in relation to (16). In (17), as in (16), the quantifier 
both must refer to the boys, and cannot refer to the bikes; in other words, in both versions 
there is necessarily reference to two and only two boys, whereas the number of bikes is un- 
specified (except that, the noun phrase being plural, reference must be to more than one bike). 

(16) Both of the boys soon found the bikes. 

(17) The boys soon both found the bikes. 

The judgments of native speakers of English are quite unequivocal on such sentences, which 
thus provide a robust but nonetheless subtle, by no means obvious test for subjecthood— it is a 
test which, moreover, does not work for all languages, Chinese, for example, lacking this par- 
ticular constraint with quantifiers such as dou 'all'. As with the criteria mentioned earlier, we 
also observe that the possibility of Quantifier Float applies to subjects whatever their semantic 



was expounding. One of the main contributions of Relational Grammar, as illustrated for in- 
stance in Perhnutter (1983), Perknutter and Rosen (1984), and Postal and Joseph (1990), es- 
pecially methodologically, has been its emphasis on the importance of uncovering criteria for 
the establishment of grammatical relations. For further exemplification of my own work in this 
area, reference may be made to Comrie (1989), and references cited there. 
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role, including patient-subjects of passive clauses: In (18), in contrast to (17), it is the bikes 
that must number exactly two, whereas the number of boys is unspecified beyond 'more than 
one', i.e. (18) has the same basic meaning as (19): 

(18) The bikes were both found by the boys. 

(19) Both of the bikes were found by the boys. 

Another subtle example of this kind is provided by constructions like (20) in relation to (21), 
constructions that some linguists have called Subject-to-Object-Raising, although this term may 
seem to imply an analysis tfiat other linguists reject — I will use it here as a neutral label for the 
construction in question. 

(20) I believe him to have hit them. 

(21) I believe that he has hit them. 

The important point about the construction in (20) is that after certain main-clause verbs, in- 
cluding believe, it is possible to have a following noun phrase (in the non-nominative if pro- 
nominal) followed in turn by an infinitive, such that the non-nominative noun phrase corre- 
sponds to the subject of the equivalent that clause. Crucially, one cannot start from an example 
like (21) and create a sentence parallel to (20) in which a non-subject, say the direct object, 
shows up in this position, as is shown by Uie ungrammaticality of (22): 

(22) *I believe them he/him to have hit. 

Once again, the corresponding passive versions (of the subordinate clause) show that tlie rele- 
vant notion of subject is independent of semantic roles: 

(23) I believe them to have been hit by him. 

(24) I believe that they have been hit by him. 

Have we therefore reached the conclusion that grammatical relations are identical in 
English and Latin? Not quite. Although it is in general the case that a Latin subject will cone- 
spond to the subject of its English translation equivalent and vice versa, fhere are nonetheless 
some exceptions, and our Latin grammarian will need to note them. For instance, English al- 
lows all four of (25H27), with agent (in (25)), patient (in (26)), or recipient (in (27)) of the 
verb give as subject. Latin, by contrast, has translation equivalents with parallel grarnmatical 
relations only for the first two. 

(25a) The man gave the books to me. 

(25b) The man gave me the books. 

(26^ The books were given to nie by tlie man. 

(27) I was given the books by the man. 

(28) Vir mihi libros dedit. 
man.NOM LDAT book.PL.ACC give.PAST.3SG 
'The man gave tlie books to rac.' 
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(29) Libri mihi a viro dati sunt.^ 
book.PL.NOM I.DAT by man.ABL given.M.PL be.PRS.3PL 
'The books were given to me by the man.' 

(30) *Ego libros a viro datus sum. 

I.NOM book.PL.ACC by man.ABL given.M.SG be.PRS.lSG 

This last set of examples may raise a question in the reader's mind: If there is not exact corre- 
spondence between subjects in English and Latin translation equivalents, are we still justified in 
using the same term in referring to the relevant grammatical relations in these two languages? 
My view on this remains as expressed in Comrie (1989). In order to identify a particular gram- 
matical relation in a particular language, we must have grammatical criteria that are valid inter- 
nally to that language. In order to identify grammatical relations cross-linguistically, there must 
be substantial overlap between the occurrences of the same grammatical relation in translation 
equivalents across the two languages. As with other aspects of grammatical structure, we are 
unlikely to find exact identity across all translation equivalents in a pair of languages, but we 
must find a substantial overlap, as indeed we do in comparing English and Latin. It is against 
this background that I will ^proach the question of grammatical relations in Chinese in section 
3. 

3. Some Thoughts on Grammatical Relations in Chinese 

I now turn to a discussion of the relevance of these remarks to the analysis of Chinese, 
in particular Mandarin. I do so with some trepidation: I am not a specialist in Chinese, and am 
therefore dependent for the most part on secondary data, as indicated in the acknowled^ents 
in the following text, in addition to which I have used Li and Thompson (1981) as a basic ref- 
erence source. What follows should therefore not be taken as a definitive statement, or even as 
my definitive statement, about the relevant aspects of Chinese syntax. Rather, on the basis of 
observations that others have made concerning the syntactic structure of Chinese, I will try to 
place Chinese within the methodological framework that has been outlined in the earlier part of 
this article. 

At first sight, Chinese presents an even more daunting picture than English did to our 
Latin grammarian. In English, at least most personal pronouns have enough case marking to 
distinguish between subjects and direct objects, but in Chinese this is not the case: 

(31) Ta d5 w6. 
he hit I 
'He hits me.' 

(32) W6 d5 ta. 
I hit he 
'I hit him.' 

In English, word order provides a good test for subjects, despite the possible slight variation in 
word order of major constituents of the clause, but in Chinese again the situation is less clear. 
Thus, in addition to the SVO word order of (33), we also find the OSV word order of (34), 



^In this Latin passive clause, in the past tense (more specifically, the so-called perfect tense- 
aspect), the verbal complex is composed of the past passive participle of the lexical verb, agree- 
ing in gender-number with the subject, and the present tense of the auxiliary verb 'to be', 
agreeing in person-number with the subject. 
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and even the SOV word order of (35), with the result that NP NP V sequences are potentially 
ambiguous in Chinese: 

(33) W6 mSi shu-le. 

I buy book-ASP 
'I bought the book.' 

(34) Shu w6 mSi-le. 
book I buy-ASP 

(35) W6 shu mSi-le. 

I book buy-ASP 

It is therefore not suiprising that we find extreme discrepancies in the attitudes of different lin- 
guists in their attitude towards the analysis of Chinese, ranging from those on the one hand 
who argue that the various syntactic distinctions that arc overt in English or Latin are present in 
Chinese, only in covert form, to those who argue that the syntax of Chinese must be organized 
on completely different principles— or even that Chinese has no syntax, its apparent syntactic 
properties being in fact pragmatic. If we restrict ourselves more narrowly to the question of 
grammatical relations, Chinese provides a particularly interesting test case: it lacks the blatantly 
obvious criteria presented by Latin, and even to a large extent by English; therefore, within the 
framework outlined above, the linguist analy2dng Chinese is in the position of having to justify 
carefully any claims made about grammatical relations, as against the counter-claim that gram- 
matical relations are simply urelevant to the structure of Chinese. 

An apparently straightforward sentence type will serve to introduce the problems and 
the kind of argumentation that can be brought to bear, namely the so-nalled double-object con- 
struction, as in the following example: 

(36) Ta g6i w6 yi-b^n shu. 
he give I one-CL book 
'He gives me a book.' 

In this sentence, the verb g^i 'give' is followed by two bare noun phrases. The question that 
arises is the following: Do these noun phrases differ from one another syntactically, in a way 
that is not directly predictable from their semantic roles (recipient and patient, respectively)? 
There are clearly differences between the two noun phrases. For instance, their order relative to 
one another is fixed, so that the following variant is impossible: 

(37) *Ta gli yi-Un shQ w6. 

he give I one-CL book 

It would be hard to argue that this order is determined by pragmatic factors, especially in face 
of the fact that other varieties of Chinese require the order of (37), as in Hakka example (38) 
(cited from Hashimoto 1973: 522), which requires the order with the patient preceding the 
recipient.'' 

(38) Gi2 bun2 jit^-bun^ su^ r)ai2. 
he give one-CL book I 
'He gives a book to me.' 



^In addition to (38), Hakka has another variant in which the verb 'give' is repeated before the 
recipient; but what is crucial for present purposes is that the version given in (38) is possible. 
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In other words, the fact that Mandarin has the order recipient-patient, while Hakka has the or- 
der patient-recipient, seem to be relatively arbitrary facts about these two varieties of Chinese, 
of a kind that would seem prima facie candidates for treatment as syntactic. However, a further 
possibility remains, namely that the distinction could be stated in semantic terms, as was indeed 
done informally above: One would simply specify that Mandarin requires the order recipient- 
patient, that Hakka requires the order patient-recipient, and that would be the end of the matter, 
with no need to refer to any independent notion of grammatica' relation. I will leave the double- 
object construction for the moment, returning to it below in the context of grammatical relations 
and clause combining in Chinese. 

In an knportant contribution to the debate on grammatical relations in Chinese, LaPolla 
(to appear) argues forcefully against the concepts of "subject" and "direct object" in Chinese. 
The kinds of arguments that he adduces can be divided into two classes. First, there are some 
criteria whereby putative subjects and direct objects not only do not differ syntactically from 
each other, but also do not differ from other noun phrases or adpositional phrases; for instance, 
relative clause formation in Chinese allows relative clauses to be formed where the head noun 
functions as subject, as direct object, or as some other argument or adjunct within the relative 
clause. Thus, at least the possibilities of relative clause formation do not provide any basis for 
differentiating between subject and direct object, or between these two together and other puta- 
tive grammatical relations. Secondly, there are some criteria which, as LaPolla shows, fail to 
distinguish between putative subjects and direct objects, but where he does not explicitly note 
whether they distinguish between subjects and direct objects taken together and other major 
constituents of the clause. It is examples of this second kind that I want to concentrate on, in 
particular cross-clause coreference. 

A simple pair of examples illustrating the possibilities, and taken from LaPolla (to ap- 
pear), is given below: 

(39) Nei-ge i6n ba xigua, di&o z^i dishing, 
that-CL man DO watermelon drop LOC ground 

— i sui-le. 

shatter-ASF 

'That man dropped the watermelon! on the ground, 
and iti burst.' 

(40) Nei-ge r^nj b3 xigua diao z^i dishing, 
that-CL man DO watermelon drop LOC ground 

— i huang-le. 

be.flustered-ASP 
'That mani dropped the watermelon on the ground, 
and hei was flustered.' 

In Chinese, either the subject of the first clause, as in (39), or the direct object of the first 
clause, as in (40), can be the overt controller for a null-anaphor in the second clause in this 
construction, where the clauses are joined asyndetically, i.e. without any overt linker, and this 
construction therefore fails to distinguish between subject and direct object. The interpretations 
assigned are, of course, those that are pragmatically most plausible, since normally people do 
not burst and watermelons do not get flustered. But the crucial point is that there is no syntactic 
restriction differentiating between subject and direct object. Examples like those just cited arc 
particularly significant when one compares them with the English translations, involving con- 
junction reduction. English requires that both overt controller and covert target have the same 
grammatical relation, or even more specifically that both be subject, so that while a null-ana- 
phor is possible in (41) below, it is not possible in (42), or rather (42) can only be interpreted 
to mean, however counter-intuitively, that the man burst: 
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(41) That mani dropped the watermelon on the ground, 

and — i was flustered. 

(42) That mani dropped the watermelonj on the ground, 

and — i/*j burst. 

I have already introduced some notational conventions for representing certain aspects 
of cross-clause coreference, and it will be useful to make these explicit. For ease of presenta- 
tion, each clause of a multi-clause sentence will be presented on a separate line, in both the 
original sentences and the English translations. The null-anaphor, or rather the position that 
would normally be filled in the second and subsequent clauses by the overt correspondent of 
the target noun phrase, will be represented by a long dash ( — ). Subscript letters will be used to 
indicate coreference (or, with an asterisk, the impossibility of coreference) between noun 
phrases and either other noun phrases or null-anaphor positions. These conventions are purely 
to illustrate more clearly the structure and meaning of the sentences in question, and no more 
significance than this should be attached to them. In Chinese sentences with asyndetically 
joined clauses, the clauses will be separated by a comma; in English translations, the comma 
will be retained, but where appropriate the conjunction and will be inserted, purely to produce 
more natural-sounding English sentences. 

A detailed study of cross-clause coreference possibilities in asyndetically joined clauses 
is provided by Shi (1989); Shi refers to this construction as the "topic chain", although I will 
stick to the less specific term "asyndetically joined clauses". Like LaPolla, Shi notes that both 
subject? and direct objects can serve as controllers of null-anaphors, as in (43) (subject con- 
troller) and (44) (direct object controller): 

(43) Zudtian wSnshang tai m^i huijia, 
yesterday evening he not retum.home 

— i zhilzai p^ngyou nar, 

stay friend there 
— i jinz5o c^i hufldi. 

this.moming only return 
'Yesterday evening hej didn't return home, 
hei stayed at his friend's, 
and hei only returned this morning.' 

(44) W6 kanshang-le zh^i-ge guniang,, 
I fall.in.love-ASP this-CL girl 

ta ye kanshang-le — i, 
he also fall.in.love-ASP 
— i zuihou hbi ta qi3ngz6u-le. 
finally PASS he take.away-ASP 
'I fell in love with this girlj, 

he also fell in love with hen, 

and finally shej was taken away by him.' 

A further observation, not directly relevant to our current concerns but nonetheless of 
interest in this general connection, is that noun phrases that are overtly topicaiized by being 
preposed can serve as controller irrespective of their grammatical or semantic rel.'uion. The dis- 
tinction between topicaiized and non-topicalized constituents is not so easy to test in the case of 
subjects, which are normally sentence-initial anyway, but with non-subjects the distinction is 
clear-cut. Thus, alongside (44) we have (45), with topicalization of the direct object controller: 
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(45) Zh6i-ge guniangj w6 kanshang-le — i, 
this-CL girl I fall.in.love-ASP 

ta y& kanshang-le — i, 
he also fall.in.love-ASP 
— i zuihdu hhi ta qi3ngz6u-le. 
finally PASS he take.away-ASP 

(Note that I have extended the use of subscripted long dashes and pronouns to include indicat- 
ing the positions that topicalizcd noun phrases would have occupied, had they not been topical- 
ized.) The following example shows topicalization of the object of a preposition, which then 
serves as controller for a null-anaphor: 

(46) Zhangsanj a w6 yizhi dui taj hgn zunjing, 
Zhangsan TOP I always towards he very respectful 

— i dui w6 que z6ngshi bulibdcSi. 
towards I yet always disregard 
'As for Zhangsani, I am always very respectful towards himi, 
yet hei always disregards me.' 

And finally, the controller can be a topicalized indirect object, as in: 

(47) Lisii ma w6 g6i-le tai hSoduo yfiuydng-de jiajd, 
Lisi TOP I give-ASP he much useful-PTCL furniture 

— i y& hH shuo sheng xi6xie. 
also not say CL thanks 
'As for Lisii, I gave himi lots of useful furniture, 
and hei didn't even say thank you.' 

There are at least two alternative analyses that one might suggest at this point, in re- 
sponse to the data presented so far. The first is that the best characterization of the controller is 
simply that it is topic, irrespective of its grammatical relation, or more generally of its role in 
the first clause of die sequence. One would then have to argue, perhaps not implausibly, that 
the controller noun phrases in (43) and (44), though not preposed, arc nonetheless interpreted 
as topics. However, there is evidence against this analysis. It is possible to have a contro' er 
that is an indefinite, clearly non-topical direct object, as in the following example, which could 
indeed be used to introduce the cat into the discourse for the first time: 

(48) W6men m3i-le yl-zhi maoj, 
we buy-ASP one-CL cat 

— i hSn hul zhuo ISoshil, 
very can catch mouse 
dajia dou xihuan tai. 
everyone ail like it 
'We bought a cati, 

iti is very good at catching mice, 
everyone likes iti.' 

As one would expect, it is not possible to topicalize this indefinite direct object, even if it func- 
tions as the controller of a sequence of asyndetically joined clauses: 
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(49) *Yi-zhi mao w6men mai-le, 
one-CL cat we buy-ASP 
— i hgn hui zhuo ISoshd, 
ver>' can catch mouse 
d^jia dou xihuan tai. 
everyone all like it 

The second alternative would be to argue that any noun phrase can be controller of 
cross-clause coreference in a sequence of asyndetically joined clauses. Certainly, ail the above 
examples show is that the various positions mentioned are possible positions for a controller, 
and no examples have been given so far to show that any position is impossible for a con- 
troller. (The ungrammaticaUty of (49) has nothing to do with controller positions m this con- 
struction, rather it illustrates restrictions on topicalizdng noun phrases.) And one thing we have 
learned from the last few decades of syntactic research is that in order to understand the precise 
characterization of a grammatical construction, it is necessary not only to examine grammatical 
examples, but also to examine ungrammatical examples, since only by trying to draw the 
boundary between the two will we come to an understanding of the precise limits of the con- 
struction under investigation. It is therefore cmcial to consider the kinds of examples adduced 
by Shi (1989) to show that some positions are not possible positions for controllers in this 
construction. In particular, objects of prepositions, if not overtly topicalized, are not possible 
controllers of nuU-anaphors in asyndetically joined clauses: 

(50) *Wd yizhi dui Zhangsan h«n zunjing, 

I always towards Zhangsan very respectful 
— i dm w6 que zdngshl billlDilcai. 
towards I yet always disregard 
'I am always very respectfiil towards Zhangsani, 
yet hei always disregards me.' 

Nor is the indirect object in the double-object constmction: 

(51) *W6 g6i-le Lisl hSoduo ySuydng-de ji^jil, 

I give-ASP Lisi much useful-PTCL furniture 
— i y6 bil shuo sheng xifexie. 
also not say CL thanks 
'I gave Lisii lots of useful furniture, 

and hei didn't even say tharik you.' 

This last example is particularly telling, in relation to our earlier rather inconclusive discussion 
of the double-object construction, because here we have a distinction between the direct object 
(patient) and indirect object (recipient) of this construction, but one where we do not need to, 
and indeed arguably should not, refer specifically to the patient, but rather need to subsume this 
under a larger class that includes subjects and direct objects (henceforth: direct grammatical 
relations). Thus, the distinction between the two objects in the double-object construction 
turns out to coincide with a major dividing line between grammatical relations in Chinese. 

The generalizations illustrated by the examples above can be sunmiarized by the follow- 
ing two-part statement: 

(52) The controller of a null-anaphor in a sequence of asyndetically joined clauses can be 

(i) a noun phrase that is topic of the initial clause; 

(ii) a noun phrase that occupies a direct grammatical relation (subject or direct object) of 
the initial clause. 
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Notice that, in keeping with LaPoUa's generalization concerning subjects and direct objects in 
Chinese, we have so far provided no evidence for drawing a dividing line between subjects and 
direct objects. Rather, the evidence presented so far suggests a single grammatical relation of 
"direct". 

The next question to ask is whether there is any evidence for distinguishing between 
subject and direct object in their syntactic behavior. At this point, the data and their interpreta- 
tion become less clear to me, so I will content myself with presenting some relevant data and 
commenting on their possible significance. Shi (1980) extends his characterization of the class 
of possible controllers by arguing that the controller can also be, in his terms, the specifier of 
the subject, i.e. roughly a possessor within the subject noun phrase, as in the following exam- 
ple: 

(53) Tai-de t6u ySudianr t6ng, 
he-GEN head a.little hurt 

— i dilzi ydu btl shufu, 
stomach also not good 
— i m^iy6u chi wSnfln. 
not-ASP eat dinner 
'Hisi head is hurting a little, 

hisi stomach is also not good, 
and hei hasn't eaten dinner.' 

This example can, of course, also be presented with overt topicalization of the possessor noun 
phrase, though this is not necessary: 

(54) Tai t6u ySudiSnr t6ng, 
he head a.little hurt 

— i dilzi ydu bil shufu, 

stomach also not good 
— i m^iy6u chi w5nmn. 

not-ASP eat dinner 

Shi claims moreover that it is not possible for the controller to be the specifier of the direct ob- 
ject noun phrase. This suggests that there is indeed a difference between subjects and direct 
objects, but one shown in a remarkably subtle way: It is not shown directly by differential be- 
havior of subjects and direct objects, but rather by the differential behavior of noun phrases in- 
ternal to the subject and direct object noun phrases. But before accepting this conclusion, I 
think that it is necessary — and I pose this as a research task for specialists in the Chinese lan- 
guage — to investigate how general the construction type illustrated by (53) is in Chinese. The 
examples that are usually presented are from a quite narrow semantic range, so that one might 
try to argue that the relevant factor is not so much syntactic ("being specifier of the subject 
noun phrase") as semantic. At any rate, these are intriguing data that merit further detailed in- 
vestigation. 

Another set of data relevant to distinguishing between subject and direct object concerns 
sequences of clauses where there is an overt coordinating conjunction, such as bingqiS 'and'. 
In a pair of asyndetically conjoined clauses like (55), there are two possible interpretations, 
since the controller can be either the subject or the direct object: 
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(55) Lisij mai-le yi-zhi g6uj, 
Lisi buy-ASP one-CL dog 

— i/j tdoz6u-le. 
flee-ASP 
'Lisii bought a dogj, 

and hei/itj fled.' 

If, however, the two clauses are joined by means of an overt conjunction, the only possible 
interpretation is with the subject as controller of the nuU-anaphor in the second clause: 

(56) Lisij mai-le yi-zhi g6uj, 
Usi buy-ASP one-CL dog 

bingqiS — i/*j t^oz6u-le. 
and flee-ASP 
'Lisii bought a dogj 

and hei/*itj fled.' 

fi-ima facie, such data seem to provide evidence for the subject-object distinction, and are cited 
as such by, for instance, Shi (1989: 243-244), who explicitly draws the contrast between 
clauses linked by a conjunction and those linked asyndetically. One objection that I have heard 
to such data is that they are not "natural" in Chinese, in contrast to the asyndetic linkage; but 
even so, it is not clear why this construction, even if innovatory, should be subject to the con- 
straint that the controller (and, irdeed, the target) must be subjects, with the implication that 
Chinese has created a distinction between subjects and direct objects just in order to constrain 
this new construction. No doubt one could continue for a long time with such speculations; I 
prefer to leave further investigation of this construction to those who know Chinese better than 
I do. 

4. Conclusion 

A linguist who is familiar with the structure of Imguage A and who approaches the 
analysis of a very different language B might adopt one of two extreme positions, in addition to 
a number of intermediate positions. One extreme would be to assume that language B is going 
to have essentially the same structure as language A, so that features of lang-iage A that arc not 
apparent in language B will be held to be "covert" features of the structure <.,f language B. The 
opposite extreme is to assume that only "overt" features of language B are relevant to its analy- 
sis, thus denying any relevance of properties of language A that are not immediately apparent in 
language B. I have tried to show that both approaches are likely to be misguided. There arc 
deep-seated differences between languages of different types, and simply carrying over the 
structure of one language to another language can lead us to misanalyze language B, to apply 
categories to it that are simply not relevant to its structure. On the other hand, insisting solely 
on cataloguing observables can equally lead to a loss of insight in the analysis of language B, 
since there may well be properties of language B that can only be demonstrated by more de- 
tailed analysis, but which can nonetheless be clearly demonstrated by such detailed analysis. 
The crucial point is that whatever analysis we propose for language B, we must be able to jus- 
tify that particular analysis for language B. In some cases, phenomena that we find in language 
A will turn out to be relevant in describing lansuage B, even though the precise criteria serving 
to delimit those phenomena may be different in the two languages. In yet other cases, language 
B will have nothing comparable to some phenomenon found in language A, and will simply 
have to be described without that category. And in yet other cases, perhaps the most interest- 
ing, language B will turn out to have some phenomenon that is similar but not identical to what 
we find ui language A, so that careful detailed analysis will be needed to establish the precise 
similarities and differences between the two languages. I have used Chinese data to try and il- 
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lustrate some of these possibilities. However, i should emphasize that my interest has not been 
in advocating a particular analysis of Chinese — it would have been rash indeed for me to have 
done so— but rather to encourage investigators of the Chinese language to consider possibilities 
in addition to those that arc suggested on the one hand by European languages, on the other 
hand by the absence of "obvious" grammatical criteria in Chinese. Finally, I should perhaps 
emphasize that the different approaches to language-universality and language-specificity that I 
have outlined can be found within linguistic schools that are otherwise quite different from one 
another. Thus, within generative grammar we have seen a move away from the view that all 
languages are essentially the same by the introduction of the radical distinction between config- 
urational and non-configurational languages (e.g. Hale 1983) — ^with more recently a trend back 
towards the view that all languages are more sunilar to one another. And much of the detailed 
work on grammatical relations from the typological perspective can be seen as a reaction to the 
rule of thumb (and perhaps no more!) adopted in one of the pioneering works of this approach, 
Greenberg (1966), that grammatical relations can be identified intuitively on the basis of trans- 
lation equivalents. The possibilities for cross-linguistic variation arc something that every lin- 
guist needs to know about. 
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Abstract 



Chinese is shown to be a discourse accusative language in which there is a strong 
discourse pressure uniting S and A in that they both introduce given information in to 
discourse. The pairing together of S and A motivates a nominative category, namely a 
category that marks topical information, while new information is introduced into the 
discourse in the O or oblique role. Moreover, data on anaphoric links across successive 
clauses show that S/A links far outnumber S=0 links. Thus the topic continuity 
dimension also defines for Chinese a nominative/accusative {S, A} alignment. 

Word order in Chinese is shown to be more sensitive to valency role than to 
discourse pragmatics, though both factors are highly predictive of word order. The 
present study suggests that Chinese is aligned with other type C language (e.g. 
Norwegian and English) where the only device available for expressing semantic (e.g. 
that of the agent) and pragmatic functions (e.g. that of topic) is linear order. However, 
Chinese differs from these other type C languages in that when there is a conflict between 
semantics and pragmatic functions, type C languages typically resolve the conflict by 
availing themselves of a syntactic role changing process (e.g. passive), while Chinese 
typically relies on a complex interplay between semantics and pragmatics for its 
resolution. 
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1. Introduction 



It is now widely agreed that discourse pragmatics provides much of the substance 
of grammar. A primary assumption that emerges functional linguistics is that grammar is 
discourse driven and that grammar is motivated in large part by functional considerations. 
Language derives its particular organization from the ecological setting in which it 
functions and grammar emerges from recurrent patterns in discourse constrained by 
cognitive and communicative aspects of human behavior (Hopper 1987, 1988). Grammar 
is an open-ended set of regulations that are constantly being resystematized through use. 
As Du Bois (1985:363) has put it, "grammars code best what speakers do most." This 
process, known as grammaticalization (or syntacticization), is the primary explanatory 
assumption that must be made to come to grips with the nature of grammar. Focusing on 
the grammaticalization process as manifested in discourse frequency forces us to the 
conclusion that the mechanism by which grammar emerges must be sought in the 
recurrent patterns in the way people track referents, negotiate information flow and 
determine how to get their points across. Recent studies have shown that convincing 
account of grammatical facts cannot be given unless discourse structural facts are 
considered. For example, insights into grammatical relation, word order, anaphora, 
topicality and evidentiality, among others, are particular noteworthy. Grammar can't be 
just a fixed mental system recoverable from an examination of the mind of a single 
individual, but is an activity embodied and constituted in the pragmatics of 
communication (Hopper and Thompson 1991). 

In this paper we seek to shed light on the pragmatics of word order in Chinese. 
We will show that Chinese is a discourse accusative language where there is a strong 
pressure uniting S and A in that they both introduce given information into discourse. 
The pairing together of S and A motivates a nominative category, namely a category that 
marks topical informacion, while new information is introduced into the discourse in the 
O or oblique role. Moreover, data on anaphoric links across successive clauses show that 
S/A links far outnumber S/0 links. Thus the topic continuity dimension also defines for 
Chinese a nominative/accusative {S, A} alignment. 

Word order in Chinese is shown to be more sensitive to valency role than to 
discourse pragmatics, though both factors are highly predictive of word order. The 
present study suggests that Chinese is aligned with other type C language (e.g. 
Norwegian and English) where the only device available for expressing semantic (e.g. 
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that of the agent) and pragmatic functions (e.g. that of topic) is linear order. However, 
Chinese differs from these other type C languages in that when there is a conflict between 
semantics and p.agmatic functions, type C languages typically resolve the conflict by 
availing themselves of a syntactic role changing process (e.g. passive), while Chinese 
typically relies on a complex interplay between semantics and pragmatics for its 
resolution. 

Below we will first present facts of word order in Chinese and facts of the 
semantics and pragmatics of nominal arguments. We will then interpret the correlation 
between word order and discourse-pragmatic functions. Finally, we will explain 
distributional correlations. 

2. The data 

The corpus for this paper comprises one ordinary conversation and two oral 
narratives. The conversation, a spontaneous dining-table among one female and three 
males, centered around the people and the events taking place in the office where the 
participants worked. The conversation ran to 40 minutes long and totaled 1049 clauses 
(clause fragments not counted), where clauses are defined as a predicate element together 
with its argument(s). 

The two narratives were retellings of the stories about the Ghost film and the Pear 
film. The Pear narratives of different languages have been employed for infonnation 
flow studies, such as Chamorro (Scancarelli, 1985), Japanese and English (Iwasaki, 
1985), and Sacapultec (1987). The Chinese Pear narratives were produced in 1976 by 
eighteen female speakers who were undergraduate students at National Taiwan 
University. After they had seen the 15 minute-long film, each was taken individually into 
a room where a female interviewer, not an acquaintance of the interviewees, explained 
that she had not seen the movie and asked the speaker to recount to her the story of the 
film. These data formed the basis of an earlier study on referential choice in Chinese 
(Chen 1986). 

The Ghost narratives were obtained in an analogous fashion, though there were 
differences in detail. First, Ghost was a full-length film lasting 127 minutes. Secondly, 
four narrators, two females and two males, had already seen the film for over one year 
before the taping sessions in the speech laboratory of their office, where they recounted 
the film to the interviewer. Since the narrators and the interviewer were office mates 
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rather than strangers, their narration was produced in a more 'relaxed' and 'natural' 
fashion. 



Table 1. summarizes the recording time and the number of clauses produced in 
each text. Only clauses with overt or covert but recoverable arguments are tabulated in 
the count; clause fragments are excluded from the tabulation. 



It is of interest to note that of the clauses produced, main clauses predominated in 
each text, at something like 95%, and that there is no significant difference in percentage 
between any two types of texts, though one might have expected the conversation text, 
with frequent topic changes and less dwelling on details of events, to have a much higher 
proportion of main clauses. By contrast. Ghost and Pear there would be much more of 
the tellers' attempt to successfully reconstruct the stories of the films, which, in the case 
of Ghost, contain a significant amount of the portrayal of the inner world of the 
protagonists and therefore would seem to call for a wider-ranging use of mental and 
psychological predicates with accompanying sentential complements. 

3. Valency role orders 

In the following discussion, surface core arguments A, S, and O will be termed 
valency roles instead of the more usual syntactic roles (e.g. Payne 1987). Following 
Dixon (1979), A and O refer to the topical agent and patient of a transitive verb and S the 
single argument of an intransitive verb. The text data show that a strongly preferrerJ order 
of nominal arguments, relative to the verb, can be identified. Table 2. presents various 
valency role orders for clauses that have two, one, or zero overt arguments. 



conversation 

Ghost 

Pear 



Recording time (min.) 



38.8 
36.1 
41.7 



Clauses 
1049 
621 
1000 



Main clause 
1006 (95.9%) 
580 (93.4%) 
965 (96.5%) 



Table 1 . Summary of data. 
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AVO 


337 


AOV 


51 


OAV 


8 


VO 


249 


OV 


85 


AV 


28 


SV 


536 


VS 


126 


V 


273 


total 


1679 



Table 2. Distribution of various valency role orders. 

As shown in Table 2., the most common clause types in discourse are AVO for 
transitive clauses and SV for intransitives. Furthermore VO order outnumbers OV order 
by a 4.5:1 ratio (586:130). 

Table 3. shows the distribution of clauses with zero, one, and two missing 
arguments in relation to transitivity. 



number of missing arguments 


Transitive 


Mifansitive 


Total 


0 


382 


662 


1044 


1 


362 


269 


631 


2 


4 




4 


total 


748 


931 


1679 



Table 3. Distribution of clause types as a function of missing arguments. 



As seen in Table 3., there is an unmistakable tendency for clauses to have one less 
overt core argument than the number allowed. 

Table 4. presents cross-tabulations of valency rolts (A, S, and O) with pre- vs. 
postverbal position for clauses with one and two overt core arguments in the conversation 
text. Tables 5. and 6. present the same cross-tabulations for the Ghost and Pear texts. 







A 




S 




0 


total 




N 


% 


N 


% 


N 


% 


N 


Preverbal 


119 


100.0 


227 


93.4 


9 


7.6 


355 


Postverbal 


0 


0.0 


16 


6.6 


110 


92.4 


126 


total: 


119 


100 


243 


100 


119 


100 


481 






(X2 = 


360.66, d.f. = 2, 


p<.01; 


(t) = .86) 





Table 4. Syntactic role and pre- vs. postverbal order (conversation). 
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A 


s 


O 


total 




N 


% 


N % 


N % 


N 


Preverbal 


110 


100.0 


121 88.3 


8 7.3 


239 


Postverbal 


0 


0.0% 


16 11.7 


102 92.7 


118 


total: 


110 


100 


137 100 


110 100 


357 






(X2 = 


259.61,d.f. = 2,p< .01;(t) = .85) 




Table 5. 


Syntactic role and pre- vs. postverbal order (Ghost). 






A 


S 


0 


total 




N 


% 


N % 


N % 


N 


preverbal 


153 


100.0 


188 66.7 


28 18.3 


369 


postverbal 


0 


0.0 


94 33.3 


125 81.7 


219 


total: 


153 


100 


282 100 


153 100 


588 






(X2 = 


222.05, d.f. = 2, 


p< .01;(t) = .61) 





Table 6. Syntactic role and pre- vs. postverbal order (Pear). 



Tables 4-6 show that pre- versus postverbal order of nominal arguments are 
strongly associated with valency roles. Not only are the values highly significant, but 
the (]) statistics also show a very strong association between form (pre- vs. postverbal 
order) and function (valency roles). The significance of the association results primarily, 
especially in conversation and Ghost, from the strong placement in the preverbal position 
of A and S and the nearly equally strong placement in the postverbal position of O. 

Tables 4-6 also show that as expected, there is an ambiguous one to many 
correlation from form to function, since the preverbal position may be either A or S, 
though the postverbal position is nearly categorically O. The very same data in Tables 
4-6 can be rearranged, as correlations from function to form. When this is done, it can be 
readily seen that there is a near-categorical tendency for A and S to appear in preverbal 
position, and O in postverbal position, since the mean prediction rate is a respectable 
95.2%. 

4. Word order and information status of NPs 

In the following discussion, two activation states are distinguished for NPs. A 
nominal is considered given if it has already been activated at the point in the speech act 
where the nominal appears. 'New' referents refer to any nominals that are not given. 
Accessible referents that have nol been mentioned but are frame-based are treated as new 
information in this paper. 
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Table 7. presents cross-tabulations of activation states of nominal arguments 
(given, new) with pre- vs. postverbal position in the conversation text. Tables 8-9 
present the same cross-tabulations for the Ghost and Pear texts. 





Given 


New 


total 




N 


% 


N 


% 


N 


Preverbal 


314 


88.5 


41 


11.5 


355 


Postverbal 


74 


58.7 


52 


41.3 


126 


total: 


388 




93 




481 



(X2 = 52.67, df. = 1, p<.01; <t) = .33) 
Table 7. Information status and order (conversation). 





Given 


New 


total 




N % 


N % 


N 


Preverbal 


224 93.7 


15 6.3 


239 


Postverbal 


56 47.5 


62 52.5 


118 


total: 


280 


77 


357 




(X2 = 99.96, d.f. = l,p<.01;(t) = 


.53) 


Table 8. Information status and order (Ghost). 




Given 


New 


total 




N % 


N % 


N 


Preverbal 


333 90.2 


36 9.8 


369 


Postverbal 


89 40.6 


130 59.4 


219 


total: 


422 


166 


588 




(X2= 166.91, d.f. = l,p<.01;<t) 


= .53) 



Table 9. Information status and order (Pear). 



Tables 7-9 show that pre- versus postverbal order of nominal arguments are 
strongly associated with their information status. However, all three of the (j) statistics 
show the association to be much weaker than that between word order and valency roles. 
They also show that, again as expected, there is an ambiguous one to many correlation 
from form to function, since the postverbal position may equally be given or new in all of 
the three texts, though there is a near-categorical tendency for the preverbal position to be 
given. 

The very same data in Tables 7-9 can be recomputed, as correlations from 
function to form. When this is done, it can be readily seen from Tables 10-12 that the 
function-form mapping is hardly ambiguous: both the 80:20 ratio vs. 19.5:80.5 ratio for 
the Ghost text and the 79:21 ratio vs. 26:74 ratio for the Pear text are in the same 
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direction. However, it can be easily determined that the mean prediction rate (for 
predicting word order on the basis of given vs. new irformauon) is just 75%, which is not 
only lower than the 95.2% prediction rate for vale-cy roles cited earlier, but also lower 
than the normal value of 80%~90% for code fideh y (Giv6n 1992). 





Preverbal 


Postverbal 


total 




N 


% 


N 


% 


N 


Given 


314 


80.9 


74 


19.1 


388 


New 


41 


44.1 


52 


55.9 


93 


total: 


355 




126 




481 



Table 10. Information status and order (conversation). 





Preverbal 


Postverbal 


total 




N 


% 


N 


% 


N 


Given 


224 


80 


56 


20 


280 


New 


15 


19.5 


62 


80.5 


77 


total: 


239 




118 




357 



Table 11. Information status and order (Ghost). 





Preverbal 


Postverbal 


total 




N % 


N % 


N 


Given 


333 78.9 


89 21.1 


422 


New 


36 21.7 


130 78.3 


166 


total: 


369 


219 


588 



Table 12. Information status and order (Pear). 



We have shown, then, that word order in Chinese is far more sensitive to valency 
roles than to activation states (given, new) of nominal arguments. What remains to be 
demonstrated is whether word order is also sensitive to other dimensions of discourse 
pragmatic information. This will be attempted in the following section. 

5. Word order, identifiability and generality 

Identifiability is a discourse category used to characterize the speaker's 
assumption about whether a particular referent can be identified by the hearer. An NP is 
identifiable if the speaker indends and believes that the hearer can mentally tag the 
information as identifying a particular referent which will have continuous identity over 
time. A non-identifying expression is one which is either non-referential or the speaker 
believes the hearer cannot tag the information as identifying some particular identities. 
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Generality as a discourse property concerns whether a referent refers to a particular entity 
(particular) or a class of entities (generic). 



Table 13. presents cross-tabulations of activation states, identif lability and 
generality of nominal arguments with pre- versus postverbal O. Table 14. presents the 
same cross-tabulations for pre- versus postverbal S. 





Pi^verbal O Postverbal O 


total 




N 


% 


N 


% 


N 


Given 


113 


86.3 


449 


61.1 


304 


New 


18 


13.7 


286 


38.9 


562 


total: 


131 


100 


735 


100 


866 




(X2 


= 30.9, d.f. 


= 1.P 


<.01;<{» = 


.19) 


Identifiable 


112 


85.5 


535 


72.8 


647 


Non-identifiable 


19 


14.5 


200 


27.2 


219 




(X2 


= 9.35, d.f. 


= 1.P 


<.01;<{.= 


.10) 


Generic 


13 


9.9 


244 


33.2 


257 


Particular 


118 


90.1 


491 


66.8 


609 




(X2 = 


= 29.05, d.f 


.= 1.P 


<.01;<{» = 


.18) 


Human 


27 


20.6 


225 


30.6 


252 


Non-human 


104 


79.4 


510 


69.4 


614 




(X2 


= 5..27, d.f 


= 1.P 


<.01;<{» = 


.08) 



Table 13. Pragmatic and semantic categories and the order of O. 





Preverbal S Postverbal S 


total 




N 


% 


N 


% 


N 


Given 


812 


92.5 


39 


31 


851 


New 


66 


7.5 


87 


69 


153 


total: 


878 


100 


126 


100 


1004 




(X2 = 


327.3, d.f. 


= 1.P 


< .01; <{. 


= .57) 


Identifiable 


819 


93.3 


30 


23.8 


849 


Non-identifiable 


59 


6.7 


96 


76.2 


155 




(X2 = 


= 419, d.f. 


= 1.P 


<.01;<{» = 


= .65) 


Generic 


83 


9.5 


27 


21.4 


110 


Particular 


795 


90.5 


99 


78.6 


894 




(X2 = 


15.46, d.f. 


= 1.P 


<.01;<{» 


= .12) 


Human 


691 


78.7 


89 


70.6 


780 


Non-human 


187 


21.3 


37 


29.4 


224 




(X2 = 


= 4.24, d.f. 


= 1.P 


< .05; <{» 


= .06) 



Table 14. Pragmatic and semantic categories and the order of S. 

A number of significant results emerge from Tables 13. and 14. First, the 
semantic category humany'non-humans has little predictive value for order. Secondly, 
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generality is also a poor predicator of word order. Thirdly, activation state and 
identifiability work in parallel in the predictive success with word order. In other words, 
word order can be equally well or equally poorly predicted on the basis of either 
activation state or identifiability. Thus they are equally strong predictors of pre- vs. 
postverbal S, but equally poor predictors of pre- vs. postverbal O. These results taken 
together suggest that morphological types of nominal arguments in Chinese are used 
essentially to only encode either the more linguistic context-bound activation states or 
identifiability, which has a stronger mix of extra-linguistic components, since pathways 
to identifiability include not only previous mention, but also situational settings and 
invoked frames. 

Tables 12 and 13 further show that the structural split is strongly motivated by 
discourse pragmatics, but the structural split of O much less so. The prediction rates from 
pragmatic categories to pre-/postverbal S order are respectively 80.8% for activation state 
(given/new), and 84.8% for identifiability. If we believe with Giv6n (1992) that the 
perceiving mind needs a code fidelity somewhere about or above the level of 80% 
prediction rate for it to begin to bet on a 100% categorical distribution and ignore the 
margins, then Chinese can be said to have nearly grammaticized the scalar distribution of 
various pragmatic properties of nominal arguments in pre- vs. postverbal S position as 
identifying a grammatical existential construction. But the same cannot be said of the 
structural split of O. The low 0 statistics in Table 13. mean that there would be little 
predictive success from function to form. Thus the prediction rate from activation state to 
order is a mere 57.5%, and that from identifiability to order 56.3%, both of which are at 
chance level. The prediction rate from particular to order is higher, at 61.5%, which is 
still nowhere near the threshold 80% of code fidelity required of categorical distribution. 

6. Topicality hierarchy 

The present date indicate that 98% of the clause-initial position is preempted by 
an NP which is either an A (and hence categorically topic of the clause) or i (preverbal) S 
(and hencce also categorically topic of the clause). The clause-initial NP then represents 
a convergence of semantic ("role") properties of agent and the pragmatic ("reference") 
properties of clausal topic. A and S are much more topical than O not only in their 
greater propensity to preempt the clause-initial position but also in their stronger 
tendency to form continuous anaphoric links across successive clauses. To measure topic 
continuity across different valency roles, anaphoric links across adjacent clauses are 
tabulated according to the valency roles in which the co-referential referents occur in the 
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two clauses'. For example, the pronominal la 'she' in (1) appears in the A role in clause 
(a), but re-appears in the succeeding clause (b), then this particular link across A and S is 
tabulated as an instance in the linkage class of 'A-to-S'. 

(1) a. -->Y: ...yinwei ta yiqian,_ 

because 3.SG previously 
'Because she previously, 

conglai mei you zhe ge jingyan a.\ 

ever NEG have this CL experience PRT 
did not ever have this kind of experience, 

b. --> ... suoyi% suoyi na ge nude jiu xiayitiao.\ 

so so that CL woman thus be.frightened 

so that woman was frightened." (GHOST 4:207-10) 

Tables 15-17 present the distribution of various types of anaphoric links for A, S, 

and O. 



A-to-A type 
A-to-S type 
A-to-0 type 
total: 



N % 

400 63.3 

201 31.8 

31 4.9 

632 100 



Table 15. Anaphoric linkage of A (After Chui 1994:64-65). 



Preverbal Postverbal 





N 


% 


N 


% 


S-to-S type 


208 


47.1 


7 


19.4 


S-to-A type 


210 


47.5 


25 


69.4 


S-to-0 type 


24 


5.4 


4 


11.1 


total: 


442 


100 


36 


100 



Table 16. Anaphoric linkage of S (X2=10.53) (After Chui 1994:104-105). 



Preverbal Postverbal 
N % N % 



0-to-Atype 5 17.2 40 25.5 

0-to-Stype 5 17.2 40 25.5 

0-to-Otype 19 65.5 77 49.0 

total: 29 100 157 100 

Table 17. Anaphoric linkage of 0 (X2=2.56) (After Chui 1994:84). 
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Since the total number of nominals in A, preverbal S, postverbal S, preverbal O 
and postverbal O are respectively 1214, 878, 126, 131 and 735, we can easily determine 
that the percentages of various valency roles forming anaphoric links are shown in Table 
18. 



all texts 


Ghost & Pear only 


52% 


632/1214 


59% 


484/819 


50.3% 


442/878 


57.2% 


290/507 


28.6% 


36/126 


31% 


34/110 


22.1% 


29/131 


22.6% 


26/115 


20% 


147/735 


21.4% 


104/486 



A 

preverbal S 
postverbal S 
preverbal O 
postverbal O 

Table 18. Percentage of valency roles forming anaphoric links. 

As shown in Table 18., A and preverbal S are, as expected, the most predictable, 
continuous and topical valency roles, followed by postverbal S, pre- and postverbal O. 
X2 tests show that there is no significant difference either between the first two roles or 
among the last three roles. One may thus suggest the following topicality hierarchy for 
valency roles defined in terms of their ability to form anaphoric links: 



(2) 



preverbal S } ^ ^ 



postverbal S 
preverbal O 
postverbal O 



} 



Since A and preverbal S are overwhelmingly given (97.4% for A; 92.5% for 
perverbal S), identifiable (94.8% for A; 93.3% for preverbal S) and human (94.6% for A; 
78.7% for preverbal S), they are therefore the most continuous and predictable 
arguments. By contrast, all other valency roles are by implication should rank lower on 
the topicality hierarchy in an asymmetric 'figure-ground' sort of distinction, since if 
preverbal valency roles are maximally topical, then one would expect postverbal valency 
roles to be maximally non-topical. It is easy to see why this should be the case. In a 
language with a preferred word order of AVO, the postverbal O position is where 
relatively incidental or unimportant information to the development of the narrative goes 
(the so-called "unimportant information last principle"). Nominal arguments that appear 
there tend to be be new, non-identifiable and/or non-human. On the other hand, 
postverbal S is where new entities are introduced into discourse for the first time, often at 
points of higher thematic discontinuity (e.g. at paragraph or topic chain boundary points), 
reserving the\ preverbal S position for the more topical, identifiable and/or human 
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referents. Most of the postverbal initial mentions on S have little thematic continuity (i.e. 
are not needed in subsequent discourse), just as most of the postverbal mentions on O do. 

But why should preverbal O rank lower on the hierarchy just as postverbal O and 
postverbal S do? We have shown above that the structural split of O is hardly motivated 
by discourse pragmatics. On the other hand, the low ^ statistics already tell half of the 
story. On the other, the 86.3:61 ratio versus 13.7:39 ratio for given/new, or the 85.5:72.8 
ratio versus 14.5:27.2 ratio for identifiablity, or 79.4:69.4 ratio versus 20.6:30.6 ratio for 
humanness (see Table 14.) are all in the same general undraraatic direction. It should not 
be surprising therefore that both postverbal O and preverbal should have turned out to 
rank equally low on topicality hierarchy. 

If our interpretions of preverbal O, based on the distribution of various discourse- 
pragmatic dimensions of information encoded on NPs, are on the right track, then they 
should seriously call into question the proposal by Sun and Giv6n (1985) that OV word 
order in Chinese is a contrastive, topicalizing device. 

7. "Subject"-like properties in clause-initial position 

If we disregard positional role differences in S or O, and if we sum all of the 
various types of anaphoric hnks (see Tables 15-17) regardless of directionality, we find 
that the most preferred links are either identical links with A, S (i.e. A/A, S/S), or non- 
identical S/A links, as Table 20. shows: 



Anaphoric types 


N 


% 


A/A 


400 


32.5 


S/S 


208 


16.9 


0/0 


77 


6.3 


S/A 


411 


33.4 


S/0 


64 


5.2 


A/0 


71 


5.8 


total: 


1231 


100 



Table 19. Types of anaphoric links (After Chui 1994:140). 

The results shown in Table 19. suggest that co-reference across adjacent clauses is 
fairly independent of valency roles, since any anaphoric link is possible. However, since 
82.8% of the links (A/A, S/S and S/A) are co-reference under identity of primary topic (A 
or S), the clause-initial NP position in Chinese, which represents a convergence of 
semantic ("role") properties of clausal topic, has thus a reference-related 'subject' property 
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characteristic of language where subject is syntactically important. Furthermore, the fact 
that S/A anaphoric links are the most common preferred way of forming anaphoric links 
suggest that a S/A semantic pivot in the sense of Foley and Valin (1984:119) has 
emerged, a pivot which neutralizes the semantic distinction between S and A. Chinese is 
thus unlike Eastern Pomono, a strict active-stative language whose switch-reference 
system seems to monitor semantic roles of actor and undergoer directly, not the more 
abstract S/A semantic pivot (Foley and Valin 1984:121). However, the fact that 11% of 
the links are of the S/0 and A/O types and that there was a near total absence of the M 
sentences from the data (there being only 3 out of 2551 clauses, or just 0.1%) strongly 
suggest that changes in the pragmatic role of a nominal in Chinese does not tend to 
changes in "syntactic role", and that Chinese does not have the kind of pivot system like 
that found in either English or Dyirbal where the choice of pivot is largely governed by 
the demands of topicality and cross-clause linkage under coreference, hence necessitating 
the use of passive or antipassive construction to permit alternative choices of pivot when 
deK:anded by context. Chinese appears, then, to be opting for the "Philippine style" 
solution, whereby the overall order is highly sensitive to both valency roles and 
pragmatic information, and yet preverbal S/postverbal S order is stongly influenced by 
the pragmatic properties of the nominal arguments. A strictly reference-prominent (or 
subject-prominent) language would have opted for a more 'unified' syntactic treatment of 
S rather than a structural split of S motivated by semantic or discourse-pragmatic 
considerations. 

To summarize, Chinese represents a language where semantic role and pragmatic 
reference have been more or less grammaticized (given syntactic encodings) and are 
expressed by the same means, linear order, but there is no, or rarely, role-changing 
morphosyntactic process. Such a language tends to discourage a patient from being a toic 
or taking the clause-iniial position to avoid the conflict between an agentive topic and a 
non-agentive topic. This is indeed the case in Chinese. Table 2. shows that OAV 
accounts for just 0.4% of all clause types and that the so-called topic comment 
constructions are practically non-existent. (There is none in the present data). 

A/S in Chinese has acquired some 'subject' properties, though still not all of the 
subject properties characteristic of subject-prominent languages (e.g. subject-verb 
agreement), so that it is still a category of "grammaticized topic" in the sense of Comrie 
(1988), distinct from topic and from subject. 
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Even though Chinese lacks a syntactic category that can be identified with the 
subject category of other languages, it offers some insights into why the subjects of other 
languages have the syntactic properties they do. 



1 
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(Jerry Norman, 1988:1 l)#Mffl-bf@MMi}#l[5t55^|ff«6^+y\'fIit§W • ri 
IM6^mSU5^^i-f^ (classical) m^^ft (modem) ^^Jj^JSS ° iltji^M^mS : 

(1) i§^g|^^ti56^» 

(2) Wil « 

(3) ^W^t^S» 

(4) f|^i^m^>^m^>Sil» 

(5) i^^^^Iffl*i5J» 

(6) A-N isl0 ; A f^^adjective(Jf^^i5l) - N f^^noun(;ai5l) « 

(7) SVOi5]0 o 



- ^ ar]ffi(6)2jjm(6' ) : 

(6* )1^|ff5H§--4''L^igpJ}^ » 



2 3 4 5 6 7 




+ + + + + + + 



+ ?- + - + + 



2 



+ 



+ 



+ 
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+_.___ _(6) - 
mmm - + - + - -ce) - 



pjTH^iftmg : (4)^^mi^0wii • {^mmm~-^'Lmm • (7)svoi5i 

2.1 ±SrIig6^mi 

(denominative) : -sgSjm • ^tlfflm^lBKb : MWS^f ^^S^#J^I?!c6^1ii5]^ • ?f 

2.1.1 ^mit-smmmm- 

ri si 

*C-rjang 'grang ^gi] ft* *C-ijangs grangs 

*tjak 'thag *tjjks thags ^f^p°p 
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2.1.2 mmim^mits-mmmm 



[111 *dj»ns > dzjuSn Dl| *sdj»n > zjuSn 

^ *mjiat> mjat ^ *smjiat> xjwat 



^ *mak > m»k 
# *C-rjam> lj»m 
Ji *ngjans > ngjw 

2.1.3 mm^mm 

*p- > p- : *b- > b- 

*tj- > ts'j- : *dj- > z'j- 

*trj->tj- : *drj->^j- 

*k- > k- *g- > y- 



M *sm»k 


> x»k 


^ *srj»m 


> §j>m 


^ *sngjans > xjwi 


mm 









































2.1.4 ms^mi 

ftii$(«t4^iii§!5:^^i^^3i»ii7-i20Mtr-:5i5/r' •"^/^" ^"mm" '"m/m"^ 
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mm ^ mm^iimnm±-&mmmm n- mm > am#^.ii±i^ -r- 

2.2 mmm-^'Lmm 
mmmm^i ° m'^mu«m^ ^w^»{m%) • 76-77) 

° mm^x^wK-m • $p"if ^" ^ ^ "Bm" ^ "i>fj"ciiiM$t) ° 

Greenberg's Universal 18 (Greenberg, Joseph. Universals of Language (second edition, 1966), 
p.86): 

When the descriptive adjective precedes the noun, the demonstrative and the numeral, 
with overwhelmingly more than chance frequency, do likewise. 

NA AN 
Num.-Noun 8 10 

Noun-Num. 1 1 0 
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Hi^mzm^m ° ci^ ■ mm) 
WcmmM • $n"-mis • -mam • mm) • "-m^m-mm"im. • -^^t) • m 
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2.3 ±^[SOV]l5]061]Mir^ 

2.3.1 [Neg + Pro + V] [Qq + V] 

tt^S^JlgiiJl^^tNeg + Pro + V] ?P [Qo + VjfPMtSOVJiHlJ^fiUitit^ ° 
" ^"S" ^"M" ^"g" ° 

gijjE] : r-mai^ ° (III ' 9tm) 

nm^^ ' *^wti2 ° • 1^14^) 
nm^^mm ' mm "Mm) 
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mnnm ° cit • m • mm) 

giii: ? ? (Ira ' 

Alt) 

? wfi^wti ° (ft ' hm) 



2.3.2 fn"^j.ffl" 

>(< ^ Vt + ^ > 'A- \ Z ^ > 'BVt 
fB ^ Vt t ^ > IS i ^ + Vt > -Wi 



2.3.3 (S I ' VI ^11 (Pro + V) 
[Pro ^ V) 
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mzm^ m^mm - mmmm ° «<§# - iufi - mm» ; m:R#:R 



. ^it ; "Its" . : "SASi$[" • nmmmm ; "sa^p-t' - awr-i 
)mm ° ) 

«mm» ^ «M$-»mmn - mm"m"^mme.mmm'immn ° - «mm» 
'm^m"mx-mm" ^ "mAmm"i:^m - iiw'vj«tr c«28^) - -j^A^H-cHSTt;^) 

2.3.4 [Oi f + Vt] 



T 
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Oh, Grand-master Yin, you should be the base of Chou; you should hold the 
ordering of the state; the four regions, them you should unite; the Son of Heaven, 
him you should (augment:) strengthen; you should cause the people not to go 
astray. . (Karlgren, The Book of Odes , p. 1 33) 



M ^. four regions, them (you should) unite 
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kha-rtsan yonspai mi de Itas byun 




these four regions, you should unite; this Son of Heaven, you should strengthen 



2.3.5 [Pr f CVl mmi^ 

[SO\mj¥^m'M ' -iS::g:/M5](CV • coverb)^^® « il^GreenbergS^JUniversaJ 4 : 
With overwhelmingly greater than chances frequency, languages with normal SOV 
order are postpostional. 

"( VJj 1958 : 358-360) ' M^^iiM 1981 : 80) : 

^m^M^ («/r:»BS15¥) 
imZl;X " («fc»BS18^1-:) 



m ■■ 
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tB ' "W ^ "M"ikw<mm0m^-t} m% ■ 361-362) : 

MW"M"^ftfieiJ^iJ(i^li$( 1987 : 26) : 
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M ° 



^^mmmmmtmmh ^ sov • nm^m.m^±-^mm^mm ^ sommm ° 
st(2) ^ (3) ^ (4) ^ {bmm^mbmmm^mm^m • affWi^Aiayiiifi^^s^t 

i5{^tj«mm^Wi»±{i%i) ' 27-291:) ° m-c^mm"^^ ■ 
mm ^ mmmm ^ mmmm ^ mmm ^ mmm ^ nm ° 
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jI@§?£^^PS5^«rlin^^»C-t) ° A. meillet and M. Cohen, Les langues du monde , 
1st ed. {\924M^miW^^ ° 



' ^fflfti^i^i^^i^Ctonegenesis) ° ^^9c±^±-^'^Mm ' ^nJtg;^^^ 
Benedict)l5^Austro-Thai hypothesisdfS— ^fif^)!!^!^:!! ' (^i§I^Ma'^ESii*f^Pit 
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Slg tuYuI ' tuksan > Evenki tukucin "/Jn^" . Lamut tu- ~ tuyu- "^/Jn^" « 

^f.ii:^ ®^' ir^Maff1^7fi;^tt » 5g)tlE^:( Chang Kwang-chih, Archaeology of 
Ancient China (4th edition), p.234 fF)|g^if5 Wft ' -a:S'J^7eitij5000^f^ - ^^Mi^ 
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' S4@i^# • Jf^J^^-a:^i)fLS (Chinese Interaction Sphere) - TKII^?fi#235MS^J 

wwd nnwim mm 



ERIC 



-63- 



74 




197. Expansion of rcgionil Neolithic cultures in 
China from 7000 B.C. {Itfi) to 5000 B.C. {tight) 
and 4000/5000 B.C. (btlow). 



Source t Chang Kwang-chih, Archaeology of Ancient China (Uth ed., I986), p. 235 
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m^mmmnmmmm^m • umm^mmmrmmit • %nmi^ • ^^^^ " 
^ ^ mm^^mmmii^ • ii^ N-Aia]i7-&<]iiM^ « mmumm^mm • -^$3iW^g##6^ 

■m ' mtmmmm^ • m^mmm^ ° (4)i5]jf t^^sov^i^svo « fs^iis^'ftft^tg 

Dravidianig-ti " Gerard Difflothlg^ Munda nj fg g H * ^- 6^ SVO ^ 1^ ft W 
SOVC'Austro-Asiatic Languages", in Encyclopaedia Britannica (1975), 2:483e) <> ^MMt ' 

t^&MB#^Pi^ig^t:b^(crosslinguistic comparison)a^lil£5t5§ • JIftrIi#g{iH^{t 
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1935 : m^'&m'W ^ "^"> • «mm7ti-^%±^+^m-Xn» • 967 -996 ° 

1957 : «rli§5^^»(±) 

1958 : 

1982 : «nM'^^» 
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SOME REMARKS ON WORD ORDER 
AND WORD ORDER CHANGE IN PRE-ARCHAIC CHINESE 

Alain Peyraube 
CRLAO, CNRS, Paris & 
Hong Kong Baptist College 



ABSTRACT 



Since the 1970s, following the work of Greenberg (1966) on 
language universals, the problem of word order and word order 
change in Chinese has been much debated. The discussions have been 
essentially surrounding the hypothesis of Li & Thompson (1974) 
according to which Pre-Archaic Chinese (12th century B.C.) was a 
SOV language, which might have changed to a SVO one between the 
10th and 3rd centuries B.C. , before shifting back to SOV again, the 
last stage being still in action: a) SOV > b) SVO > c) SOV. 

The hypothesis SVO > SOV has been much criticized, especially 
by those (Light 1979, Sun & Givon 1985) who try to show that, 
synchronically , Chinese is and renains a SVO language and that the 
OV order is a marked [+ contrast iveness] order. However, very few 
scholars have challenged the first: change a) SOV > SVO in Archaic 
Chinese, because a SOV order in Pre-Archaic Chinese seems more 
plausible, insofar as this order is sometimes found in Classical 
Chinese (Early or Late Archaic Chinese) for pronominal objects 
under special conditions (especially when these pronouns are 
interrogative or in negative sentences) , or even for full lexical 
NP, when they are followed by pretransitive markers shi or zhi . 

It has therefore been supposed that these phenomena - even if 
they are minor - are relics of an ancient stage of the language, 
and that the regular word order must have been SOV. This opinion 
appears all the more to be probable to some that the language of 
the oracle bone inscriptions seems to reveal many more preverbal 
objects or preverbal PP than during the following stages, i.e. 
Early and Late Archaic. From here, some have perhaps too hastily 
concluded that Proto-Chinese must have been SOV and, therefore; 
Pro* r>-Sino-Tibetan also. Indeed, all the Tibeto-Burman languages 
(ex( ept Bai and Karen) have verb-final order. 

This paper will show that the hypothesis of a) (SOV > SVO) 
change is not empirically motivated. Any meticulous analysis of the 
language of the oracle-bone inscriptions, as the one which has just 
been completed by Shen Pei (1992), does not allow the conclusion 
that Chinese was more SOV in Pre-Archaic than in Early or Late 
Archaic. The jiaguwen language shows a regular order of SVO. To 
suppose that in a more ancient stage than the one we know today the 
regular order could have been SOV is, under these conditions, of 
pure surmise. 
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SOME REMARKS ON WORD ORDER 
AND WORD ORDER CHANGE IN PRE-ARCHAIC CHINESE 

Alain Peyraube 
CRLAQ, CNRS, Paris & 
Hong Kong Baptist College 

0 . Introduction 

0.1. Since the 1970s, following the work of Greenberg (1966) 
on language universals, the problem of word order and word order 
change in Chinese has been much debated. The discussions have been 
essentially surrounding the hypothesis of Li & Thompson (1974) 
according to which Pre-Archaic Chinese (12th century B.C.) ^ was 
a SOV language, which might have changed to a SVO one between the 
10th and 3rd centuries B.C., before shifting back to SOV again, the 
last stage being still in action: 
(1) a) SOV > b) SVO > c) SOV 

0.2. The hypothesis of changing from b) to c) relies 
essentially on the following facts: PP, which were usually post- 
verbal in Classical Chinese, have become mostly pre-verbal today, 
and the fea construction (where fea is a pre-verbal direct object 
marker), non-existent in Archaic and Pre-Medieval Chinese, is 
increasingly in use in Contemporary Chinese. The path of change, 
purely internal, might have been the following: the serial-verb 
structure "S+V1+0+V2" might have changed into a single-verb 
structure "S+Prep. (or marker) +0+V" after a classical process of 
grammaticalization changing the ve^b into a preposition or a 
marker . ^ 

This hypothesis has been much criticized, especially by those 
(Light 1979, Sun & Givon 1985) who try to show that, 
synchronically , Chinese is and remains a SVO language and that the 
OV order is a marked [+ contrastiveness ] order. ^ 

0.3. However, very few scholars have challenged the first 
change a) SOV > SVO in Archaic Chinese, because a SOV order in Pre- 
Archaic Chinese seems more plausible, insofar as this order is 
sometimes found in Classical Chinese (Early or Late Archaic 
Chinese) for pronominal objects under special conditions 
(especially when these pronouns are interrogative or in negative 
sentences), or even for full lexical NP, when they are followed by 
pretransitive markers shi ^ or zhi ^ . * We even find objects 
of prepositions before some prepositions. ^ 

It has therefore been supposed that these phenomena - even if 
they are minor - are relics of an ancient stage of the language, 
and that the regular word order must have been SOV. This opinion 
appears all the more to be probable to some that the language of 



ERIC 



•70- 



SI 



the oracle bone inscriptions seems to reveal many more preverbal 
objects or preverbal PP than during the following stages, i.e. 
Early and Late Archaic. From here, some have perhaps too hastily 
concluded that Proto-Chinese must have been SOV and, therefore, 
Proto-Sino-Tibetan also. Indeed, all the Tibeto-Burman languages 
(except Bai and Karen) have verb-final order. * 

0. 4. I will show that this hypothesis of a) (SOV > SVO) change 
- which was probably first suggested by Wang Li, who was comparing 
Archaic Chinese, with its preverbal pronominal objects, with French 

is not empirically motivated. Any meticulous analysis of the 
language of the oracle-bone inscriptions, as the one which has just 
been completed by Shen Pei (1992), does not allow the conclusion 
that Chinese was more SOV in Pre-Archaic than in Early or Late 
Archaic. The jiaguwen language shows a regular order of SVO. To 
suppose that in a more ancient stage than the one that these first 
documents are available to us, that the regular order could have 
been SOV is, under these conditions, of pure surmise. 

In the following account, I will make a distinction between 
object order and PP order, i.e. between the sequence SOV vs SVO and 
the sequence S PP V vs S V PP. 

1 . The place of the objects 

I will distinguish between pronominal objects from full 
lexical NP objects. 

1.1. Full lexical NP-objects 

When the object is a full lexical NP, the basic order is 
undoubtedly SVO. Everybody will agree with this statement. Having 
said this, one has to admit that there are situations which need 
more discussion. One also finds in jiaguwen examples of the type 

(1) < ^ I?] J T 

san bai qiang yong yu Ding 

three hundred Qiang use-f or-sacrif ice to Ding 

Three hundred Qiang have been used for sacrifice to Ding. 

One would see in such sentences a (S)OV order. Indeed, the 
current analysis (Guan Xiechu 1953, Chen Mengjia 1956, Chen 
Chusheng 1991) is the following : DO (Direct Object) + Verb + 
Preposition + 10 (Indirect Object). Therefore, there should also be 
in the language of jiaguwen an OV structure - certainly 
exceptional, but non the less existent - co-occurrent with the 
basic (S)VO order. 

Tang Yuming (1990) and Shen Pei (1992) are the only ones who 
talk about subject-patient sentences ( shoushi zhuyu iu ) ( in other 
words, the pre-verbal NP should not be an object but a subject- 
patient) existent in Chinese, of all stages (archaic, medieval, 
contemporary). ' I think they are right. As emphasized by Shen Pei, 
these sentences have effectively the following characteristics : 
(i) if a subject-agent is present, it is placed between the patient 
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and the VP ; (ii) the negative form of such sentences is : "patient 
+ negation + verb". These characteristics differentiate them from 
preverbal noun-object sentences, where the subject is in initial 
position and where the adverb of negation is in front of the noun- 
cbject . 

Sentences (1) (2) are therefore not of pre-verbal object , but 
of subject-patient. 

1.2. Full lexical NP-objects introduced by hui and wei 

There also exists in the language of the oracle-bone 
inscriptions a "(S) + hui ^ (or ml A%. ) + O + V" form for 
affirmative cases and "(S) + Negation + wei + o + V" for negative 
cases. And these particles hui and wej have been interpreted as 
pre-verbal markers, like U^yL in Classical Chinese, or ba 
/ j i^ng ^fj- in Medieval and Contemporary Chinese (Guan Xiechu 
1953, among others). Examples: 

(2) ^. ¥ \^ M <^ i^t-- 
hui hei yang yong you da yu ^ 

hui black sheep utilize-for-sacrif ice there-is big rain 
(If we) utilize a black sheep for sacrifice, there would 
be abundant rain. , , u " v 

(3) fo ^ X. ^ ^-i^ Cfe i)li^f*)i^^ 0^^') 

wang wu wei Long fang fa 

king negation wej Long tribe fight 

The king will not fight the Long tribe. 
Such an analysis is problematic, other than the fact that 
there is no reason for analyzing san bai qiang as an subject- 
patient in (1) and hei yang as an object in (2) (they are indeed of 
the same type of sentences, no matter whether hui is present or 
not), it is clear today that hui and wei are not simple markers of 
pre-verbal objects, as one may think, but focalizers serving to 
stress the constituent they follow, this constituent being an 
object, or a subject, or even an adverbial. ' 

(4) ^ ftf- ^ iji^ (Djamouri 1988: 245) 
wei Di znao wang 31 

wei God provoke king illness 

It is God who has provoked the illness in the king. ^ 
In this last sentence, it is the subject which is focalized. 
That imi and wei could thus introduce constituents other than the 
objects, proves that hui and wei are not simple markers of pre- 
transitive objects. 

Thus, sentences (2) and (3) should be understood as: "(if) it 
is a black sheep which is utilized, there would be abundant rain" 
and "It is not the Long tribe that the king has to fight". In (2) 
imi focalizes the subject-patient hei yang and in (3) wu wei 
focalizes the object Long fana . Another example with an object 
focalized with hyi in an affirmative sentence: 

(5) 5, ^ ^ ^g. 6442) 
wang hui Tufang zheng 

king liui Tufang leave in expedition 

It is against the Tufang that the king leaves in 
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expedition. 

These sentences of pre-verbal object preceded by hui / wei are 
therefore not unmarked sentences, but marked by a f ocalization of 
the object. These are nonetheless sentences with pre-verbal 
objects. One therefore has to admit that the language of the 
oracular bone inscriptions, when it wants to focalize an NP-object, 
uses two devices : (i) put in front of the said object a marker of 
f ocalization ; (ii) move it, w:.th its marker, in pre-verbal 
position. In the absence of hui / wei , the order remains of course 
(S)VO. 

I would nonetheless like to emphasize here that the Pre- 
Archaic is not different here from the EAC or even from the LAC 
which uses markers of f ocalization shi ^ ou zM behind (and not 
in front of) the objects, before moving them in pre-verbal 
position. It is therefore unreasonable to think that the Pre- 
Archaic shows more (S)OV order than the Early/Late Archaic, at 
least in this problem of preverbal objects introduced by markers of 
focal ization. It seems that the contrary is the truth. 

Huang Dekuan (1988) remarks indeed that this " hui / wei +O+V" 
form is present in all periods of the oracle bone inscriptions, but 
he also notes that the examples are more numerous in the 
inscription3 of the last period; and that since the llth century 
B.C., WQi could replace hui in the affirmative sentences. ' 

The subsequent evolution miaht have been the following: in the 
bronze inscriptions (iith-8th c. B.C.), aei had almost entirely 
replaced faiii, then an intermediate form appeared: " wei +O+ shi 
+V", which immediately left its place to »»0+g}ii+V" (all these forms 
are attested in the Shana shu , I0th-8th c. B.C.), which itself 
would be later on replaced by "0+ zhi ^ +V". " 

No matter what the evolution was, one thing seems to be quite 
certain today: markers liui and stfei in Pre-Archaic were focalizers, 
contrastive markers. It is also the case in Early and Late Archaic 
Chinese for shi and alii. It is therefore not reasonable: a) to 
speak of a natural, unmarked SOV order while the object were 
preceded by such markers, b) to suppose that the Pre-Archaic was 
more sov than Early/Late Archaic, in that respect. 

1.3. Pronouns-objects 

What about pre-verbal pronouns which were not introduced by 
markers? The situation is the same. Pre-Archaic was not more SOV 
than Early/Late Archaic Chinese. 

We know that in Classical Chinese pronoun-objects are usually 
pre-verbal when they are in negative sentences, or when they are 
interrogative pronouns in interrogative sentences as well. Ex. : 

(6) i % fl^ C0 f : 

3 in yu wei bu er sha 

now I f or-the-moment negation you kill 

Now, for the moment, I won't kill you. ,« . 

wu shei qi ' bu wu zhi ye ' 

I who abuse negation I know particle 
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Whom I abi'.se? (He) does not know me. 

They become post-verbal, as the NP-objects already were, 
later, some time during the Han. 

For the JGW, Guan Xiechu (1953) considers that the pronoun- 
objects are pre-verbal in the same conditions as in Classical 
Chinese. Chen Mengjia (1956), on the other hand, thinks that the 
constraints are stricter : it is necessary a) that the negatives 

should be those in bu ; b) that the pronoun should be ^ ^ . 
11 

He indeed seems to be correct. Examples : 

(8) ^ % ^ % t ^% % : Wi) 

Di bu wo qi shou you ^ ' 

God negation we modal-particle give assistance 

God wiil not give us assistance. 
In this example, the pre-verbal pronoun-object is the indirect 
object of a double-object construction. More examples with single 
objects are: 

r? I- ^ A ''*^'*) 

Ui bu wo hah ^ 

God negation we dry-up 
God will not dry up us. 

(10) *i ^ t 1^%. 1?-^ 

Zu Xing bu wo hai 
Ancestor Xing negation we harm 
Ancestor Xing will not harm us. 
There are also some cases there the pronoun is not wo and 
where the negation is not bu, but these are extremely rare : 
(11) 

wu yu hai 

negation I harm 

(He/They) will not harm me. 
Shen Pei (1992: 23) has found 64 negatives with preverbal 
pronoun-object: among these, 57 use the negation bu and the pronoun 
wo, 4 the negation bu and the pronoun yu, 2 the negation wu and the 
pronoun yu, 1 finally the negation bu and the pronoun er. One thing 
is certain: the preverbal pronoun-objects are always personal 
pronouns, never demonstratives, just the same as in Classical 
Chinese. 

The first conclusion that one can draw is the following : 
there are indeed preverbal pronoun-objects in the language of the 
oracle bone inscriptions, but they are probably rarer than in 
Archaic Chinese; they are indeed only limited to personal pronouns 
(demonstrative pronouns are excluded from this structure) and they 
concern almost exclusively the one pronoun in the negatives with 
the sole negation by,. 

Moreover the inscriptions of the first period also show that 
these pre-veioal pronouns-objects, even in negative sentences, 
could also be postverbal. It is almost always the case when the 
negative is not fea, naturally, but also sometimes when the negative 
is bu . 

Finally, as Djamouri (1988: 462) points out, the negative bu 
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could be equivalent to bu wei . meaning "it is not". If this 
hypothesis is correct, the preverbal object-pronouns in the 
jiaauwen would have to be interpreted as always focalized and the 
sentences (8) to (10) be translated as : "It is not to us that God 
is going to give assistance" (8), "It is not us whom God will dry 
up" (9), "It is not us whom Ancestor Xing will harm" (10). 

1.4. Conclusion 

The situation of the Pre-Archaic language could therefore very 
well have been the following: (i) the regular, unmarked order was 
SVO; (ii) there was also an inverse order SOV, but this order was 
marked (the object was stressed, introduced by a marker of 
focalization hui or wei ) ; (iii) in the negative sentences with bu, 
however, when the object was the personal pronoun wo, the focalizer 
hui or wei was not necessary. From marked, these sentences became 
progressively unmarked, giving birth to an unmarked SOV order. 
Thus, pronouns could not have been conservative of an ancient 
order, as it is usually believed, but, on the contrary, initiators 
of a new order which, subsequently, had never been successful in 
imposing itself for the full-fledged lexical NP. " 

Now let us look at the FP order in the language of jiaguwgp» 

2. The place of the PPs 

One usually considers that the PP, mostly preverbal in 
contemporary Chinese, were postverbal in Classical Chinese. 
However, criticizing Li & Thompson (1974), many linguists (Huang 
1978, Li 1980, Sun 1991) have noted that PP are far from being all 
postverbal in Classical Chinese (Early/Late Archaic). Thus, for the 
two common prepositions yu 3* and yi WK , if the first is 
essentially postverbal (proportionately more than 90 % of the 
cases), the second is basically preverbal (according to Sun 1991, 
only 12 % of yi occur in a postverbal position) . " 

2.1. What then was the situation in the language of the oracle 
bone inscriptions? The situation is even more complex. Both Guan 
Xiechu (1953) and Chen Mengjia (1956) consider that the natural 
order is (S)+V+PP. Wang Li (1958: 368) on the contrary thinks that 
in this period the order was not well fixed and that the locative 
PP could be either preverbal or postverbal, while admitting that 
the postverbal cases are more frequent. 

Only the preposition yu is present in considerable frequency. 
" The yu-PP f as already noticed, are essentially postverbal, but 
the examples of preverbal yu-PP are not rare (many more than in 
Classical Chinese) , which could suggest that before the jiaguveP 
period, Chinese had perhaps been a language where the PP were 
preverbal (cf. Wei Pei-chuan, forthcoming). 

I will try to show that this induction is very questionable. 
Shen Pei (1990, 1992) shows convincingly that it is necessary, if 
one wants to see more clearly, to distinguish PP [+ time] from the 
other PP, PP [- time]. 
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2.2. The place of the yu-PP [- tine] 



The Yu-PP [- time] are mostly postverbal, as in : 
(12) i >{|^ ? it : 

wang wang yu chun ^ ^ 
king go to suburb 
The king went to the suburbs. 
However, sometimes, especially when the preposition yu is not 
the dynamic locative preposition meaning "to" (dao 5.1 in 
Contemporary Chinese) but the static locative preposition meaning 
"at" ^ in CC), or when yu is not a locative preposition but 

a dative one, the yu-PP can be either postverbal or preverbal 
Example of a preverbal yu-PP ; 



(13) f ? f t : ^^^^O 

yu Fu Jia sui 



to Father Jia ask-for 
It is to Father Jia that (we have to) ask for. 
It is better, however, to say, as the translation of (13) 
indicates, that the preverbal yu-PP are then marked; they are 
stressed, focalized. Cf. Chen Mengjia (1956), shen Pei (1992). 

Shen Pel cites several examples, in context, where it is 
obvious that the PP are focalized. He adds that it is probably 
difficult to show that ail the preverbal PP [- time] are focalized, 
but It IS impossible to show that they do not express emphasis. 

Thus, yu-PP [- time] are postverbal in unmarked utterances 
and preverbal while they are put into emphasis, i.e. in marked 
utterances . 

2.3. The place of the ya-PP [+ tixta] 

The yu-PP [+ time] have a different behavior. They can be 
either postverbal or preverbal, but the preverbal ones are more 
numerous. They then can be, of -:ourse, marked, stressed, or 
focalized; however, for most cases, they are unmarked, and not 
focalized. Ex. : 

(14) f 5§ Cife 

yu da X ri jiu ^ ^ 

at Da X day perf orm-wine-sacrlf ice 

Perform the wine sacrifice at the Da ? day. 

(15) yu ren wang qi tian f ft i 1 © <k ■ 2AZ^^\ 
at Ren king modal -particle hunt ^ ' ) 
At the Ren day, the king will go hunting. " 

This is a situation very different from the one that has iust 
been discussed for PP [- temps]. However, one would be wrong to 
draw the conclusion that the PP [+ time] might all be preverbal in 
a stage of language prior to the one represented by oracle-bone 
inscriptions. 

Indeed, as shen Pei (1990) has noticed, in the most ancient 
inscriptions, those of the first period, which can de dated at the 
Jatest to the last decades of the Wu Ding reign, the tinie PP in yu 
are postverbal, as in: ^ 
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jiu sheng sui yu gengyin 

wine-sacrifice elevatory-sacrif ice immolating-sacrif ice 
at Gengyin 

The wine, elevatory and imr^olating sacrifices (will be) 
at Gengyin day. 

After having remarked that nouns of time, in these ancient 
inscriptions are also sentence final or sentence medial, Shen Pei 
naturally concludes that there was an order change in the language 
of oracle-bone inscriptions, from "V+PP" to "PP+V" when the PP were 
time PP, and that the other PP remained postverbal. In other words, 
the order of the PP, in the most ancient stage we know, was indeed 
"V+PP" . 

2.4. Cone lus ion 

PP were originally all postverbal. Those of time were the 
first to change. They became preverbal, since the time of the 
oracle bone inscriptions. The other PP [- time] remained mostly 
postverbal, at least in unmarked utterances. Preverbal PP [- 
time], exist naturally, but they express a focalisation, they are 
marked. Preverbal PP [- time] not expressing focalisation, i.e. 
those which are not marked, are dated after the j iaguwen . 

3 . Conclusion 

3.1. In conclusion, a meticulous study of the i iaguwen 
language does not allow to assert that SOV or S+PP+V preverbal 
orders are found in greater numbers than in stages posterior to the 
language of the oracle bone inscriptions. Early or Late Archaic 
Chinese. Therefore, there is no evidence to justify the hypothesis 
of SOV > SVO or S+PP+V > S+V+PP changes in Archaic Chinese. Such a 
hypothesis has been formulated by many linguists to explain certain 
SOV orders (particularly in the case where the objects are 
interrogative pronouns, or simple pronouns in the negatives) or 
S+PP+V orders (especially when the PP are introduced by yi) . 

3.2. If there were changes, these were rather changes of an 
opposite nature which had taken place, that is SVO > SOV or S+V+PP 
> S+PP+V. Indeed, the first of these changes concerned personal 
pronoun objects, which were effectively postverbal, like all the 
other objects, but which became preverbal in the negatives. The 
second change concerned the PP [+ time], which were also postverbal 
as the other PP, but which had become preverbal. 

These changes were never imposed to marginalize the old 
orders. 

3.3. One cannot, of course, make any final conclusion from 
here that Proto-Chinese was SVO and not SOV. The jiaqxawen language 
does not represent Proto-Chinese. It is not impossible that in 
stages prior to j iaguwen , which are unknown to us, the language 
shows a SOV order. One can make such a hypothesis by arguing that 
several marked and marginal orders, in the j ia guwen language. 
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reflect such an order. We know that the irregularities are often 
relics from the past and that, as said Meillet, "la grammaire 
comparee doit se faire en utilisant les anomalies bien plus que les 
formes regulieres". 

• What we still have to deal with is that if the exceptional SOV 
and S+PP+V order of iiaguwen were relics of an ancient general 
SOV, it is not likely that these "relics" were marked utterances. 
The marked utterances, in the history of a language, are indeed 
new. It is unlikely also that these marked utterances, in iiagrwen , 
if they were really relics from an ancient order, could be found in 
lesser quantities than in the periods following the Pre-Archaic, 
i.e. Early and Late Archaic Chinese. 

3.4. Whatever the order of Proto-Chinese , one thing is sure. 
The iiaguwen language does not give the least indication which 
could justify in any way that Proto-Chinese might have been SOV. 
The contrary is more likely. 



Footnotes 



1. What I mean by Pre-Archaic Chinese is the language of the 
oracle-bone inscriptions f jiaguwen ) , I4th-llth centuries B.C. The 
other periods are : Early Archaic Chinese, I0th~6th c. B.C., Late 
Archaic Chinese, 5th-2nd c. B.C., Pre-Medieval Chinese, 1st c. B.C. 
- 1st c. A.D. For the justification of this periodization , see 
Peyraube (i.988a). 

2. Such cases of grammaticalization have been analyzed by Peyraube 
(1988b) . 

3 . For a good review of this long-winded debate on word order and 
word order change in which historical syntax was formerly 
entangled, see Qu Chengxi (1984) where the arguments raised by Li 
& Thompson (1974, 1975) Tai (1973, 1976) in favor of a change SVO 
> SOV are discussed in detail, as well as the arguments given by 
Light (1979), Mei Kwang (1980) against such a change. See also 
Huang Shuan-fan (1978), Li Mengchen (1980), Travis (1983), 
Hashimoto (1984), Sun & Given (1985) and Wang Mingquan (1988). 

4. See Yu Min (1981), Yang Bojun (1982), Yin Guoguang (1985). 

5. Sun Chaofe:i (1991) has found cases where the object of 
prepositions yu :J and especially yl l^/^ was sometimes pre- 
prepositional , without the necessity of any marker, or of the 
object being a pronoun. In other words, the "prepositions" are in 
fact postpositions. 

6. See LaPolla (1990, chapter 5; 1993), Sun Chaofen (1991). 
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7. For the notion of subject-patient, put forward by Zhu Dexi , see 
Lu Jianming (1986). The only difference between the jiaguwen 
language and that of the posterior periods is the following: in 
Ancient Chinese the VP is rarely a simple verb while it is the case 
in tha oracle-bone inscriptions. 

8. See Djamouri (1988), Huana Dekuan (1988), Zhang Yujin (1988) and 
Shen Pei ( 1992) . 

9. See also Huang Tianshu (1991). 

10. See Yin Guoguang (1985). 

11. In fact, in Classical Chinese, things are not as simple. Zhou 
Guangwu (1959) has done an exhaustive study on the pronoun-objects 
pre-verbal in the negatives of several works of Early/Late Archaic 
Chinese, and has concluded that the situation is relatively 
complex. It depends on the nature of the pronouns and of that of 
the adverbs of negation. He also agrees with the two constraints 
raised by Chen Mengjia (1956). 

12. Djamouri (1988) has also noted that the object pronouns are 
rarer than subject pronouns. He quotes the following figures : 97 
object pronouns wo for 315 wo subjects, 6 objects yu for 152 yu 
subjects. 

13. It is probably true that in Romance languages the SVO to SOV 
order change affected full NPs before object clitics, but this is 
far from being a general phenomenon. Steele (1977) argues against 
the conservative nature of clitic pronouns on the basis of 
reconstructed c^anges in Uto-Aztecan. Similarly, in Modern Greek, 
we have postverbal full NP objects and preverbal clitic objects, 
but these citic objects arose long after Greek had undergone its 
SOV to SVO change. See Lightfoot (1979: 152). 

14. These figures should be taken with precaution insofar as they 
were established from one single chapter from Menqzi and one single 
chapter from Zuo zhuan . See also Peyraube (1988a) who finds, for 
the same period (Late Archaic), that 70 % of yi-PP are preverbal 
against 30 % postverbal, but only in double-object constructions. 

15. Other prepositions are zi ^ and perhaps zai and epog ' 
but they are less frequent. I will discuss here only the place of 
the yu-PPs . 

16. In this last example, the subject is between the PP [+ time] 
and the VP, but usually the subject appears in the initial 
position . 

17. See LaPolla (1992). 
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Formosan clause structure: transitivity, ergativity, and case marking 

Stanley Starosta 
University of Heidelberg and University of Hawai'i 

1. Prologue 

In the beginning was Greenberg. And Greenberg saw that the tongues of the earth 
were many, and that the generative descriptions of them were few. And Greenberg 
said, "Let us go out into the fields and into the libraries, even into the dim studies and 
into the musty sandalboxes full laden with dataslips that lie therein, and let us make 
sweet-smelling order from amongst the unruly data collections that rest upon the face 
of the earth, and let us make graven generalizations in accordance with the 
configurations that are made manifest by our labors." 

And that is how modem linguistic typological research began. 

2. Function-based typology 

What Greenberg was up against was vast amounts of observations made from the point of 
view of utteriy inconsistent descriptive frameworks, or from the point of view of the observers' 
unstated and frequently unrealized preconceptions. It would have been nice if he had had at his 
disposal a thousand complete and explicit descriptions of typologically diverse languages, all stated 
within the same formal and constrained and proven grammatical framework by trained and 
competent and experienced researchers, but he didn't. He was faced with the choice of either 
waiting for the linguistic millenium or doing something with what he had available. 

His decision, as we all know, was to go ahead. Since he didn't have formally consistent 
descriptions to work with, he had to create workable categories that he could superimpose on the 
data and use to extract generalizations from it, and the categories he chose, not surprisingly, were 
rather subjective and intuitive semantic ones. As William Croft notes, 

'We are attempting to determine the universal properties of relative clauses (RCs) by 
comparing their syntactic form in a large number of languages. To do this it is necessary to 
have a largely syntax-free way of identifying RCs in an arbitrary language. Our solution to 
this problem is to use an essentially semantically based definition of RC. (Keenan and 
Comrie 1977:63). ' (Croft 1990:12) 

Since Greenberg was, I assume, a native speaker of English with an education in the American 
school system, the intuitions he drew on in creating his categories were anglocentric ones, and 
since most or all of the researchers who applied them were also speakers of English, it isn't 
surprising that the categories seemed natural and intuitive to them too, and that they were able to 
apply them with a good degree of consistency. 

I think that this was great as a temporary expedient. It was nice to have categories like 
Greenberg's 'subject' and Comrie's and Dixon's S, A, and P/0 and Keenan and Comrie's 'case 
hierarchy' to facilitate discussions of data among linguists who didn't share a common theoretical 
conceptual grid. However, that should have been just an intermediate step, pending the 
development of the consistent and explicit and constrained frame of reference which would make 
possible a more rigorous and theoretically well founded comparison. As the results started coming 
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in, the original categories should have been revised to compensate for the original English bias, and 
a set of empirically founded natural categories should have evolved which could have served as one 
of the pillars of a realistic universal theory of language. 

However, this was not to be. Instead, in the vacuum created by the theory wars of the 60's 
and 70's, this anglocentric and unformalized 'functional' approach to language comparison took on 
a life of its own. Categories like 'subject', S, A, and P, originally created as situational roles for 
establishing functional correspondences among the diverse linguistic structures that encoded these 
roles, became increasingly regarded as syntactic constructs in some assumed universal but never 
formalized theory of language. 

The consequence of this approach in the area of syntax at least has been a typological 
framework that regards all languages in effect as more or less radical deviations from the prototype 
language, English. The most striking example of this has been the treatment of ergative languages. 
Of the exceptions to various universals proposed in the anglocentric functional approaches to 
typology, a statistically significant number of them can be placed at the doorstep of ergativity. 
Ergative languages have been knocking with increasing insistence on the door of linguistic theory, 
and I think that it is fitting that here in Taiwan, the homeland of one of the biggest and most 
ergative language families in the worid, the Austronesian family, we should open that door, 
welcome them in, and draw the benefits from the precious gifts they bring with them. 

3. Problems with the functional approach to syntactic typology 

What's wrong with S, A, and P? Two things: (1) pseudotransitives and (2) disjunctions. 
First of all, we here are all familiar with the pattern that is often presented to syntax students in 
describing the difference between ergative and accusative languages: 

Figure 1) 

Accusative | Ergative 

case marking | case marking 




Nominative Accusative i Ergative Absolutive 



So what's wrong with that? The answer is in the part that's missing. . An accusative 
language groups Ss ('intransitive subjects') together with As ('transitive subjects'), it is said, while 
an ergative language groups Ss ('intransitive subjects') together with Os ('transitive objects') 
(Dixon 1979:59). The identification of S is fairiy though not completely straightforward: if we can 
identify a large class of sentences which contain only one MP, we are fairly safe in calling that ' S'. 
But what about A and P? If we can identify a class of transitive sentences, then picking out the A 
and the P will not be so hard, but as Hopper and Thompson have shown us so vividly, semantic 
transitivity and syntactic transitivity, though linked together in intricate ways, are not the same 
thing. Semantic transitivity can be established in accordance with some intuitive and rather 
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subjective guidelines, but the identification of syntactic transitivity requires a careful language- 
specific syntactic analysis. 

It should be obvious that a typologist surveying a large number of languages does not have 
time to do a careful language-specific syntactic analysis of each one, and so in applying this 
criterion for identifying ergative languages, he or she will almost always grab for semantic 
transitivity rather than syntactic transitivity, frequently I think without even being aware that there 
is such a difference. The consequence is that the resulting identification of a language (or its case 
marking system, to be more specific), as ergative or accusative has only marginal syntactic 
significance. 

This kind of shoot-from-the-hip semantically based determination of ergativity has, as 
Jeanne Gibson and I have shown (Gibson and Starosta 1990), had bad consequences in the analysis 
of Polynesian languages, and it is starting to have bad consequences in the analysis of Formosan 
ones, for the same reason: these languages typically have more than one pattern for translating 
English transitive sentences, so both will count as 'transitive' for the typologer in the street, and the 
result will be a determination that the languages are accusative or possibly, in Tang Ting-chi's 
terms, 'focusing languages'. However, when syntactic and morphological considerations are taken 
into account, and when we look at the semantics more carefully in the terms used by Hopper and 
Thompson, it turns out that only one of the patterns is syntactically (' canonically') transitive and 
that the other is not. When only the canonical grammatically transitive sentences are used in the 
determination of ergativity, the result comes up quite clearly ergative for the languages I have 
looked at in this way. The other two-argument pattern, although possessing both an 'A' and a 'P' 
in the conventional typological usage, is grammatically and semantically intransitive. I will refer to 
this latter pattern for the remainder of this paper as 'pseudotransitive'. 

The second fundamental problem with an S, A, and P analysis is disjunctions. In reading 
older work by Dixon and Comrie and recent work by Croft, it is striking how often cross-linguistic 
generalizations are stated in terms of 'S and P' or 'S and A'. Grammarians refer to this kind of 
term showing up in a rule as a disjunction, and it is a fairly reliable sign that there is a problem with 
the analysis, and that a generalization is probably being missed. One of the advantages of the 
alternative case-marking system I will present below is that it can describe many of the same 
phenomena more generally, that is, without the need for disjunctions. 

What's wrong with the category 'subject' in typological work? There seems to be 
agreement that something is wrong with it, because several prominent linguists working within this 
tradition have suggested that maybe we should dispense with it altogether. As Schachter found out 
when looking at the category of subject in Taglog (Schachter 1976), 'subject properties' seem to 
split up into two groups, which are treated quite differently in Tagalog syntax. The conclusion he 
should have drawn from this, though as I recall he didn't, is that it was a mistake to set up a group 
of 'subject properties' on the basis of English grammatical subjects in the first place. That is, the 
problem is not that Tagalog splits apart a unity, but rather that English, as an accusative language, 
links together two distinct primitive categories. Nominative and actor, and that ergative languages 
such as Tagalog keep them apart. I will try to show below, based on data from Formosan 
languages, that keeping these two categories apart in grammatical theory makes it possible to 
capture some nice language-specific and cross-linguistic generalizations 

There's something else wrong with the conventional ergative-absolutive analysis, in addition 
to using semantic rather than grammatical transitivity in defining it. That is the use of the term 
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'Absolutive' as the name of the case that marks the S and P (continuing for the time being to use 
these functional labels). When we compare the Absolutive category across languages, we find 
some typical properties that generalize very gratifyingly: Absolutive Ns are typically 
morphologically simpler than non-Absolutive Ns; Absolutive MPs are typically the least omissible in 
context, and if there is only one NP that agrees with the predicate, it is the Absolutive one. It is 
very nice to see these patterns emerging over and over when analyzing a new ergative language, 
but a bit disquieting if we look over our shoulders and notice that the guys over in the accusative 
workshop have found a set of NPs that have exactly the same properties, but that they are calling 
them 'Nominative' or '(grammatical) subject'. 

So if these two categories have the same properties across languages, why should we give 
them a different name? The answer is twofold: (1) tradition, and (2) S, A, and P. European 
linguists who encountered ergative languages for the first time didn't want to call the unmarked 
NPs 'Nominative', in spite of all their similarities, because they marked the 'direct object' of a 
transitive sentence rather than the 'subject'. In more modem typological terms, if one case form 
marks S+A, as it does in accusative languages, and if we have already decided that S+A is a 
'subject', then we may come to think of 'Nominative' as the case form that marks the 'subject'. If a 
case form in another language doesn't mark a 'semantic/deep subject' (S+A), however, how can we 
call it the subject case, even if we lose significant cross-linguistic generalizations by not granting it 
the same name? 

For a linguist, generalizations should be sacred, and technical terms are hypotheses about 
natural categories that shoud be maintained only as long as they don't get in the way of capturing 
generalizations. The assumption that Nominative marks S+A is the one that is causing the 
problem, and must be dropped if we want to solve the problem. Nominative and Absolutive should 
have the same name (I will call them both 'Nominative'), and should be defined as whatever case 
form marks S and either P or A. We can then also redefine 'subject' in a useful way as the 
combination of Nominative and either S, P, or A. 

One of the places where we could use a good definition of 'subject' is in the statement of 
the Keenadn and Comrie NP accessibility hierarchy: 

'The general pattern that [Keenan and Comrie] discovered for a large class of relative- 
clause types can be described as follows: 

NP Accessibility Hierarchy 

subject<direct object<indirect object <oblique' (Croft 1990: 108-109) 

'Compare the claim in Keenan and Comrie 1977 that reative clause formation, and possibly 
many other syntactic processes, apply most readily to subjects, transitive or intransitive.' 
(Comrie 1978:391) 

So what's wrong with that? Before I tell you, I should say that I have the greatest admiration for 
the NP accessibility hierarchy. It makes some real and concrete broad-ranging generalizations 
anout a let of languages. What it doesn't do, I have found over the years, is give the right answers 
for ergative Austronesian languages like those of Polynesia, the Philippines, and Taiwan. If we 
regard these languages as accusative, or use Tang's 'focusing' characterization, then the NP 
accessibility hierarchy describes relative clauses in these languages perfectly. Thus for example 
Elizabeth Zeitoun (Zeitoun 1992b: 33) rejects an ergative analysis for Tsou in favor of Tang's 
'focusing' analysis for Austronesian languages, and one advantage of this decision is that the NP 
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accessibility hierarchy would then fit Tsou relative clauses quite nicely. That is, by her analysis 
(Zeitoun 1992b:3), Tsou does have nominative case, and she refers to the nominative-marked NP 
as the 'subject', in accordance with Tang's 'focusing' analysis. An examination of other data shows 
that it would thus fit the NP accessibility hierarchy for relative clauses. However, what if she were 
to look over at Arlene Ho's MA thesis on Yami (Ho 1990), Lillian Huang's recent work on Atayal 
(Huang MS (1993)), or at the somewhat earlier work on ergativity on Philippine languages in the 
GB, RG, and lexicase frameworks, and decide that Tsou is ergative after all? What then would 
happen to the generalizations she could have made about Tsou relative clauses? Does Tsou 
suddenly become a counterexample to the NP accessibility hierarchy instead of one of its empirical 
supports? 

The problem is that while Austronesian ergative languages typically (exclusively?) relativize 
only on one argument, the 'Absolutive', the Absolutive is not the 'subject' in Keenan and Comrie's 
anglocentric view, and therefore has no place in the hierarchy at all. 

'The NP accessibility hierarchy is subject to numerous qualifications and has some 
exceptions.... We will simply note here that, not stirprisirigly [italics mine], accessibility to 
reiativization is based on the ergative/absolutive distinction in some languages, and in those 
languages the absolutive role is the least-marked role (i.e. the only one accessible to 
reiativization).' (Croft 1990:110) 

As a consequence, ergative languages require a set of separate but equal generalizations about 
relative clauses etc. For example, in addition to a statement that passive must apply in an 
accusative language in order to move an argument into the subject position in order to make it 
eligible for some process to apply, there will be parallel statements for ergative languages of the 
following type: 

'In order to make the A argument accessible to relative-clause formation, the verb must be 
antipassivized, thus placing the A argument in the absolutive case (and structurally marking 
the construction).' (Croft 1990:1 1 1) 

If we try to improve this state of affairs, though, by recognizing that the Absolutive is the 
same thing as the Nominative, and replace the anglocentric term 'subject' by '(case-role-bearing) 
Nominative actant', then the most essential term of the NP accessibility hierarchy 
(Nominative<non-Nominative) fits ergative languages just as well as it fits accusative ones. They 
aren't exceptional any more, and Tsou relative clauses are well-behaved once more. 

4. The ergative analysis 

I have just outlined the reasons for replacing a functionally based typological framework by 
a grammatically founded one. However, I realize full well that this is not going to satisfy linguists 
accustomed to dealing with lots of real data from lots of languages. Thus what I propose to do for 
the remainder of this paper is to present a case-marking system which is based on the analysis of 
parts of 76 different languages from thirteen language families or areas within a single formal, 
explicit, and constrained grammatical framework, lexicase dependency grammar. I will then 
attempt to show how each of the essential primitives of the system is empirically supported by 
showing how it makes possible the capture of morphological, syntactic, and semantic 
generalizations about Formosan languages. 
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5. Case marking and ergativity 

Xexicase' is a generative (formal and explicit) and constrained version of dependency 
grammar. The diagram below presents the lexicase counterpart of the conventional ergativity- 
versus-accusativity diagram given as Figure 1): 

Figure 2): Accusative versus ergative case marking (lexicase) 

Accusative: actroNom Ergative: PAToNom 



[-trns] 



[+trns] 




In this diagram, [±TRNS] represents the distinction be tween grammatically (not 
semantically) transitive versus grammatically intransitive verbs. Every verb in every language is 
marked positively or negatively for this feature, pat (patient) and agt (agent) are g rammatical 
case relations, not subjectively identified situational roles. Every verb takes a PAT as a dependent, 
and every transitive verb takes an AGT. (There are only three other case relations, LOC (locus), 
COR (correspondent), and MNS (means).) ACTR (actor) is a macrorole' (a term taken from Role 
and Reference grammar) which matches the AGT of a transitive verb and the PAT of an 
intransitive verb. (Fom now on, 1 will always take ^transitive' to mean ^grammatically transitive' 
unless otherwise indicated.) NOM is the nominative case form. It refers to any morphological 
and/or syntactic configuration which is common to both the single argument of a simple intransitive 
verb and either the PAT or AGT of a grammatically transitive verb. 

As mentioned earlier, Nom is prototypically the the least marked case form in terms of 
morphology; the Nom actant is the least dispensable nominal constituent in a sentence, and if there 
is agreement between a predicate and any argument, there is agreement at least between the 
predicate and the Nom actant. Acc (accusative). Erg (ergative). Gen (genitive), and Ins 
(instrumental) are also case forms. Given these basic categories, it is easy to state a definition of 
ergativity as opposed to accusativity: an ergative case-marking system is one in which Nom marks 
PAT, and an accusative case-marking system is one in which Nom marks actr. (Note that I am not 
excluding the possibly that both systems may coexist in a single language, as they do of course in 
Dyirbal for example.) Nom can mark only PAT or AGT, so that there is no such thing in this 
system as e.g. Tang's ^goal subject' (Zeitoun 1992b:7) In an accusative pattern, PAT is always 
marked by Acc, but in an ergative pattern, AGT may be marked by the Gen case form, as in 
Formosan and Philippine languages. Ins as in Tibetan, or by a special Erg case form, as in Dyirbal 
and Hindi. 

Case relations in a lexicase grammar are perceptual rather than situational. A language may 
encode the participant in a given situation in more than one way, and assign it different case 
relations accordingly. Deciding whioh grammatical case relation an NP bears is thus a grammatical 
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matter, and cannot be settled by looking out the window and seeing who starts off with the 
baseball and who ends up with it, since a given language may allow such a situation to be encoded 
in more than one way. Assigning a given case relation to a given NP is not an a priori situationally 
based choice but an empirical hypothesis, testable on the basis of which assignment results in the 
best set of language-internal and cross-linguistic generalizations. 

Once case relations have been assigned in accordance with grammatical criteria, they do 
turn out to have some minimal fairly constant situational correlates. Thus PAT encodes the 
perceptually central participant (corresponding to some extent to the S+P disjunction in 
fijnctionally based approaches to syntactic typology), AGT encodes the external role which is 
perceived as impinging most directly on the PAT (often matching the functionally determined 'A'), 
and actr encodes the participant viewed as the instigator or controller of the action or state 
encoded by the predicate (matching the S+A disjunction fairly well). Case relations are mutually 
exclusive, but actr coexists with either AGT or PAT. Note that AGT and actr are not the same 
thing; each can be separately justified by the generalizations it underlies. 

6. Exemplification 

The categories I have just described are not handed down by a deity. As in any other 
empirioal science, they are hypotheses about the nature of human language, and are justified to the 
extent that they produce a more compact description of the nature of language (Occams's Razor) 
and make the correct predictions about possible grammatical configurations in human languages. 
Accordingly, in the remainder of this paper, I will discuss each of the basic components of the 
lexicase case marking system in turn, and show how its existence is justified by the language- 
specific and cross-linguistics generalizations it makes possible and the insights that result in an 
analysis of date from Formosan languages. I will occasionally compare the results with the analyses 
that would result from applying conventional functionally defined categories (S, A, and P; 
anglocentric 'subject', semantic transitivity, the Absolutive case form) to the same data. 

6.1 Primitives 

6.1.1 PAT 

The kinds of generalizations that can be stated in terms of PAT alone include verbal 
semantics, the scope of complement case relations and infinitival complements, noun incorporation, 
resultative constructions, verbal derivation, and patterns of discourse cohesion, especially 
coordination. Only the first three of these will be discussed in this paper. 

6.1.1.1 The semantics of difTerent verb classes 

English examples such as John loaded the hay on the truck and John loaded the truck with 
hay and their counterparts in other languages have long been a popular topic in the case grammar 
literature. The lexicase analysis of such examples claims that distinct though homophonous verbs 
are involved, loadi and loadi. Loadi is a transportation verb, like throw, in which a PAT is viewd 
as being moved to a LOG, while loadz is an affect verb, like cover, in which a PAT is interpreted as 
being affected by the action of the verb: 
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1) 

J threw, 1 

John UoadedJ the hay on the truck 
+trns PAT LOC 

+lctn 

2) 

J covered, 1 

John lloaded2 J the truck with hay 
+tms PAT MNS 

+lfct 

In 1), load\ or throw interprets its PAT as moving to LOC, and when the action is 
completed, all the hay is on the truck. In 2), on the other hand, loadj or cover interprets its PAT as 
being locationally affected, and when the action is completed, the truck is perceived as having been 
affected: the space it encodes is filled or covered. Thus the verbal semantics interpret the PAT in a 
particular way, and the PAT may be encoded differently in different perceptions of the same 
objective situation. 

The situation illustrated in 1) and 2) does not always involve homophony. Thus in German, 
verbs of type 2) may be prefixed by be-, and in Philippine-type languages, there are a number of 
affixation patterns for effecting this kind of reinterpretation. Linguists who are speakers of 
accusative languages have long regarded this as a very special and unique kind of phenomenon, 
even assigning it a whole new case-marking typology, as Tang Ting-chi does (Zeitoun 1993b:33), 
different from both accusative and ergative patterns. However, evidence has been piling up in 
recent years to the effect that Philippine languages are ergative.' Once that has been accepted and 
the analysis has been recast in a lexicase representation, Philippine-type Tocus' turns out to be 
nothing more than load-hay verbal derivation in an ergative mantle. The following examples from 
Zeitoun's MA thesis on Tsou will serve to illustrate the difference. I give the example first using 
Zeitoun's syntactic categories (but filling in information she left out, guessing as necessary, and 
leaving out non-relevant parts), and then in the lexicase ergative representation: 

3) [Zeitoun 1992b: 12, (28)a] 

mo mofi to emi to amo ?o ino 
FM AF Obi Obi Nm 

Theme Dative? Agent 
' Mother gave some wine to father. 

4) [Zeitoun 1992b: 12, (28)b] 

i -si fii to emi to ino ?o amo 
FM -CP DF? Obi Obi Nm 

Theme Agent Dative? 
' Father was given some wine by mother' 



' The first linguist to make this claim, though only for one class of Tagalog verbs, was Videa 
DeGuzman, in her lexicase Ph.D. dissertation (DeGuzman 1978), and was extended to the whole 
language by Stanley Starosta (Starosta 1986). The same discovery was made in a relational 
grammar framework by Donna Gerdts (Gerdts 1983) and in a GB mode by Gary Byma (Byma 
1986). 
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5) [Zeitoun 1992b: 12, (28)b] 

i -si faeni to amo to ino ?o emi 
FM -CP TF? Obi Obi Nm 

Dative? Agent Theme 
'The wine was given to father by mother' 

Zeitoun's discussion of this approach shows Tang's 'focusing' analysis to be a variant of the 
kind of analysis being done for Phillippine languages up until the mid-70's. One innovation she has 
made however, and one which does not seem to have any place in Tang's system, is to introduce a 
cover symbol NAF for all the verbal foci other than af 'actor focus' (Zeitoun 1992b:4). As she 
describes Tang's analysis, it makes no provision for this kind of binary distinc ion. In fact, this is a 
category she has lifted verbatim (without citation, in good GB style), fro t Shigeru Tsuchida's 
Ph.D. dissertation on Tsouic (Tsuchidal 976:43), and there is an extremely ^.-jori rationale for it; 
AF forms are grammatically intransitive, and NAF clauses are gramm itically transitive. 
Conventional Philippinist focus analyses used to refer to non-AF forms as 'passives', and Zeitoun 
uses this term in describing the AF-NAF dichotomy in her 1992a paper, an eariier version of her 
MA thesis (Zeitoun 1992a: 10), but it is not there in the final version (Zeitoun 1992b: 15). The 
earlier characterization in terms of active and passive suggests that the NAF forms might be 
intransitive, but in a lexicase analysis, it is the other way around, and exactly this distinction is the 
central pillar of the ergative analysis she rejects. Partial lexicase analyses for her examples, with 
different suggested glosses, illustrate this point: ^ 



3') 

mo giofi to emi lo amo 
-trns Gen Gen 

MNS LOC 
.+lctn J actr 
'Mother gave some wine to father.' 



?o 



mo 

Nom 

PAT 



tms = transitive 



4') 



SI 

Gen 

AGT 

actr 



+trns 
.+lfct 



to 



emi 
Gen 
MNS 



to 



mo 
Gen 
AGT 
actr 



?o 



amo 
Nom 
PAT 



Ifct = local effect 



'Mother gave father some wine.' 



5') 



Si 

Gen 

AGT 

actr 



faeni 
+tms 



.+lctn 



to 



amo 
Gen 
LOC 



?o 



emi 

Nom 

PAT 



Ictn = location 



to mo 
Gen 
AGT 
actr 

' Mother gave the wine to father.' 

There are several points to note here. The first is that emi 'wine' is translated as definite in 
5'), but as indefinite in 3') and 4'). Reason: the wine is the PAT, the central participant, in 5') but 
not in 3') and 4'), and PAT in Philippine and Formosan languages is almost always interpreted as 
definite, while the notional 'object' when encoded as MNS rather than PAT seems always to be 



^ For the remainder of the paper, all analyses assigned to examples from Formosan languages are 
mine unless otherwise indicates. 
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interpreted as indefinite. Second, as with the hay in 1) above, emi 'wine' in 5') is the thing that is 
interpreted as moving to the LOC, amo 'father' under the impetus of ino 'mother'. Similarly, as 
with the truck in 2), amo 'father* is interpreted in 4') as being locationally affected by the action. 

In 3'), finally, ino 'mother', the performer of the action (actr), is interpreted as the center of 
the action, with both father and the wine downgraded to the status of accessories to the action. In 
traditional Philippinist terms, it is the action that is important here, not the entities directly or 
indirectly affected by it. In Hopper and Thompson's terms, the example is semantically intransitive, 
and this is reflected in the lexicase analysis as a syntactically intransitive sentence. Unfortunately, I 
can't think of an English example of a three-argument intransitive sentence, but they exist in other 
languages, e.g. French: 

6) [Laurent Sagart, p.c] 

Elle peignait des fleurs sur les murs. 
Norn -tms Abl Lev 

PAT +lctn MNS LOC 

actr 

' She painted flowers on the walls.' 

6.1.1.2 Scope of complement and ajunct case dependents 

A non-subject complement case relation has a PAT in its scope. Thus the LOC 
complements in the following Rukai sentence (twalay tarumak, ?akila iiglig) refer specifically to 
the positions of the PAT (kayvay tudar)) rather than the domain within which the whole action 
took place: 

7) [Li 1973:123, (14)]' 

kiaani?alay sa sasivtra twalay tarumak ?akila [igtig kayvay tudag 
was-blown wind firom to mountain this tm-roof 

+tms Gen +sorc Lev +goal Lev Nom 

+lctn ACT LOC LOC PAT 

actr 

'This tin-roof was-blown firom Tarumak to the mountain.' 

In fact exactly the same pattern applies in intransitive clauses, though it is a bit harder to see 
because PAT and actr are marked on the same participant: 

8) [Li 1973:122, (8)] 

ania?alay twalay ubula ?akila tarumak kay a6a6am sore = source 

flew firom hill to this bird 

-tms +sorc Lev +goal Lev Nom 

+lctn LOC LOC PAT 

actr 

'This bird flew from the hill to Tarumak' 



M am tentatively using an ergative analysis here rather than following Li's original accusative 
analysis. This question is currently under active reconsideration. A desirable feature of the lexicase 
approach is that the question of the scope of complement case relations is independent of this 
determination, and generalizes across both linguistic classifications. 
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The same patem is found with other complement case relations, which again bear directly 
on the PAT, while other verbal dependents refer to the action or state encoded in the predicate. 
Thus in the following Atayal example, biru? "the materials' relate specifically to Tali--he is the one 
who ends up with them-while the adverb suhan describes the action of sending as a whole: 

9) [Huang 1993:70, (45)a] 




stu-n 
send 

+tms -maku? 
+lctn -IS.G 

Gen 

ACT 

actr 

'I will send Tali the materials tomorrow' 



)iru? 

300k 

Gen 
MNS 



suhan tali 
tomorrow Tali 
Adv Nom 
PAT 



In a Chomskyan grammar, this might be accounted for (if at all) in terms of a hierarchical structure 
in which complements are closer to the verb than adjuncts, but that won't in general work, since an 
adjunct may be closer to the predicate in linear order than a complement, as in the example just 
given. 

We can also regard the PAT as the 'scope' of an infinitival complement constmction, in that 
the missing subject of the infinitival clause (Nom-actr in a lexicase analysis) is interpreted as 
identical to the PAT of its matrix verb, regardless of whether the matrix verb is transitive or 
intransitive. Two examples fi-om Atayal will illustrate this point: 

10) [Huang 1993:89, (3)b]' 

A 



m-usa? -ku? 
go 1 
-tms Nom 
+fint PAT 
actr 



m-aziy 
buy 
-tms 
-fint 



Nom 
PAT 
actr 



qaya 
thing 
Gen 
COR 



mijka? 
Taipei 
Lev 
LOC 



" I am going to Taipei to buy things' 
11) [Huang 1993:89, (14)a] 



qrq-an -ku? -nya? m-ihiy A isu? 
stop I by_him beat 2S.L] 
+tms Nom Gen -tms Nom Gen 
+xtns PAT AGT -fint PAT COR 

actr actr 
'He stopped me fi-om beating you" 

It might appear that the appropriate generalization is that the missing lower-clause subject is 
controlled by the upper-clause Nominative constituent, but a comparison with an accusative 



"In this paper, I will indicate the presence of missing constituents by the use of delta's (A). I do 
this as a convenience to readers who have no particular interest in formal syntax. In a fully 
specified lexicase representation, there are no empty categories whatsoever: no trace, no PRO, and 
no unfilled nodes. 
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language like English, where the upper clause PAT is in the accusative, shows the the appropriate 
generalization is upper PAT - lower actr: * 



12) 

He stopped me from A beating you 

Nom +tms Acc Nom +tms Acc 

AGT PAT AGT -fmt PAT 

actr actr 

6.1.2 Transitivity 

It was the great service of Paul Hopper and Sandra Thompson (Hopper and Thompson 
1980) that they sensitized us to the distinction between semantic and syntactic transitivity, and 
made it possible to use each to support an analysis of the other. In my own experience, it was this 
paper which helped me to recognize the difference between true transitive constructions and 
pseudotransitives in Polynesian, Philippine, and Formosan languages, and the paper seems to have 
done the same service for Lillian Huang in her paper on Atayal transitivity (Huang MS (1993)). It 
is the great disservice of flinctional typologists to continue to use the situational categories A and P 
as if they were rigorous syntactic constructs, thereby obscuring this distinction. 

The typical situation in Formosan and Phipippine languages with respect to transitivity is 
the following: there are two classes of two-argument sentences, both involving 'A' and 'O' in some 
pretheoretical sense, but differing from each other systematically in morphology, syntax, and 
semantics, exactly as Hopper and Thompson have led us to expect. I will refer to them here as 
'transitive' and 'pseudotransitive'.* Some of their differences can be summarized as follows: 

6.1.2.1 Morphology: 

Intransitive verbs in Formosan and Philippine languages, including both single-argument 
intransitives and pseudotransitives, commonly take a prefix of the form /w(V)- or an infix of the 
form -(V)/n-, while transitive verbs are commonly suffixed by -{V)n, -/, or -a, or, less commonly, 
prefixed by s{V)-. Either of the terms can be absent, so that in Tsou there are intransitive /w-forms 
contrasting with unmarked transitive forms (cf Zeitoun 1992b: 15- 17). As mentioned earler, 
Zeitoun labels the two sets 'AF' and 'NAF', and finally, on p. 49, refers to Hopper and Thompson's 
article and seems almost to equate AF with intransitivity and NAF with transitivity. As far as I 
could tell, though, she views this purely as a semantx matter and does not draw the syntactic 
consequences nor notice the inconsistency between an analysis stated in terms of a binary 
transitivity distinction and Tang's multipolar 'focusing' analysis. 



M am glossing over several complications here, both at the metatheoretical level and in the analysis 
of Atayal complex constructions; see Huang 1993, Chapter 4. 

*I have sometimes referred to pseudotransitives in earlier work as 'antipassives', but as Lisa 
Zeitoun has pointed out (Zeitoun 1992b:32), the term 'antipassive' is normally used to mark a 
morphologically marked member of a derivationally related transtive:intransitive pair of two- 
argument verbs. In Polynesian, though, and frequently in Tsou, it is the intransitive member of the 
pair which is unmarked, and the transitive member which bears the affix. (Her comment was stated 
in terms of transformations and inflectit n rather than in terms of lexical derivation, but the point 
about markedness is still a valid one.) 
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6.1.2.2 Syntax 



If a language has clitic pronouns, a genitive set will occur with transitive verbs and a 
nominative set with intransitive verbs. The third person Nominative is commonly zero. There are 
two variants of this pattern: (1) Tsou and Yami allow only one clitic per clause, so genitive 
pronouns occur with transitives and nominatives occur with intransitives. Unfortunately, the two 
sets are almost identical in form in Tsou, and it is only the third person forms (si or he for Genitive, 
zero for nominative) which make a clear distinction. (2) Atayal has both sets, and they may 
cooccur in a transitive clause. Surprisingly, Huang does not seem to have noticed this pattern in 
her paper on Atayal ergativity. 

6.1.2.3 Semantics 

The Formosan transitive-pseudotransitive contrast is a beautiful example of what Hopper 
and Thompson were talking about. When there is a contrast between two related verbs along this 
axis, the syntactically transitive member of the pair is also semantically transitive, and vice vti ^a. 
Huang (Huang MS (1993)) has given a very race and complete set of examples of this contrast for 
Atayal, but in case the reader is tired of this harsh northern languages, here is a pair from the gentle 
southern climes of Orchid Island, ancient home of the Yami people and more recently of a nuclear 
waste disposal site: 

14) Pseudotransitive [Ho 1990:3. l-17b] 

ya A kuman si mapapu su suli 
eat Mapapu taro 

Nom -tms Nom Gen 

PAT PAT MNS 

actr actr 
'Mapapu is eating taroes.' 

Note the absence of a third person nominative clitic pronoun. 

15) Transitive [Ho 1990:3. l-17a] 

ya na ni-kan ni mapapu u suli 

by her eaten Mapapu taro 

Gen +tms Gen Nom 

AGT AGT PAT 

actr actr 
"Mapapu has eaten up the taroes.' 

This pair illustrates a typical manifestation of semantic transitivity: 14) represents an in- 
progress action with a partial affect on the "0", while 1515) illustrates a completed action with a 
total affect. Morphologically, the verb in 14) has an -um- infix, while the verb in 15) doesn't (ni- 
marks aspect, not focus), and syntactically, 14) has a zero nominative clitic pronoun, while 15) has 
the genitive clitic na. For more examples, see Ho ',990 and/or Huang MS (1993). 

The following Tsou examples illustrate the same contrast: 
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16) [Zeitoun MS (1993), (9)a] 



mo A mimo ta chimi ?eino 

-tms Nom -tms Gen Nom 

PAT MNS PAT 

actr actr 
'Mother is drinking water .' 

17) [Zeitoun MS (1993), (9)b] 

i si ima ta ino si chimi 
-tms Nom -tms Gen Nom 

PAT MNS PAT 

actr actr 
'He has dmnk water.' 

18) [Zeitoun 1992b:31, (82); 'A(gent) F(ocus) Construction'] 

mo A eobako ta oko ?e ino 

-tms Nom -tms Gen Nom 

PAT MNS PAT 

actr actr 

'Mother is beating the child' 

19) [Zeitoun 1992b:31, (83); 'N(on) A(gent) F(ocus) Construction'] 

i ta eobaka ta ino ?e oko 
+tms Gen +tms • Gen Nom 

ACT AGT PAT 

actr actr 
'The child has been beaten by mother.' 

20) [Zeitoun 1992b:31, (84)a] 

mo A mogsi ?eoko 
-tms Nom -tms Nom 

PAT PAT 

actr actr 
'The child is crying' 

The non-auxiliary verbs in 16) and 17) and in 18) and 19) differ in the presence or absence 
of an -a suffix, reconstructed as a transitive suffix all the way back to PAN by Starosta, Pawley, 
and Reid (Starosta, Pawley, and Reid 1982), and in 16) and 20), the non-auxiliary also begins with 
the expected intransitive prefix m-. Morphologically, the m- auxiliary verbs in 1 6), 1 8), and 20) 
too are intransitive, and must be followed by intransitive verbs and zero third person pronouns, 
while the transitive non-m auxiliary verbs in 17) and 19) must be followed by a genitive pronoun 
and the transitive-suffixed ima and eobaka. Based on my own research on Tsou, I find water in 
1 7) and the child in 1 8) quite suspicious; I would have expected the water and a child. However, 
the aspectual distinctions are as they should be: progressive for the pseudotransitives 16) and 18) 
and simple intransitive 20), and completed for the grammatically transitive examples 17) and 19). 

So how does Zeitoun, working within Tang's 'focusing language' analysis, view sets of 
examples such as these? First of all, the auxiliary verbs she refers to as 'focus markers', a purely 
arbitrary category whose properties don't follow from any general principles and thus must be 
stipulatec'. 
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'Aside from these two types of transitive sentences[18) and 19)], we also find intransitive 
sentences...' (Zeitoun 1992b:31) 

Thus she regards both 18) and 19) as transitive. This shows that >he has missed the 
distinction between grammatically transitive clauses and pseudotransitive clauses, which is the key 
to the whole case-marking system. On page 11, she refers to this distinction in clause patterns as 
an 'orientation (active or passive)' (Zeitoun 1992b: 11-12), something that Tung T'ung-ho warned 
rightly against in 1964. 

Unfortunately, this whole system of grammatical transitivity in Formosan languages is easy 
to miss if we follow the anglocentric functional guidelines of conventional syntactic typology and 
look at clauses in terms of S, A, and P. Comrie states the procedure as follows: 

'A refers to that argument of a transitive verb which would be its subject in a non-ergative 
language like English....' (Comrie 1978:330) 

'In the transitive construction, we start from a set of canonical (prototypical) transitive 
constructions, referring to actions where an agent acts upon a patient, and use A for the 
agent in such a construction and P for the patient.' (Comrie 1984:92) 

Unfortunately, prototypical' here refers to situational rather than grammatical prototypes. 
Semantics in, semantics out, syntactic insight nil. 

'The A/P terminology can be extended, however, to other transitive constructions with the 
same syntactic behavior, but where the participants are not, strictly, semantic agent and 
patient.' (ibid.) 

All very fine, but by then the damage has already been done. 
6.1.3 Norn 

6. 1 .3. 1 Nominative vs. Absoiutive 

As noted earlier, NPs labeled by the terms Nominative and Absoiutive share a large number 
of grammatical properties across languages, including almost everything except the case relation 
they mark in a transitive clause, and an adequate universal linguistic theory will need a means of 
accounting for this. In lexicase it is done by labeling all such NPs Nominative'. However, in a 
conventional ergative analysis which uses the term 'Absoiutive' for the case form which marks 
S+P, the similarity between Nominative and Absoiutive is an unexplained coincidence: 

'In nominative-accusative morphology, it is typical (though not quite universal) for the 
nominative to be less complex morphologically... than the accusative and oblique..., whereas 
in ergative-absolute morphology it is typical for the absolute to be less compelx 
morphologically... and the ergative and oblique more complex....' (Comrie 1978:368-369) 

In the lexicase analysis, on the other hand, which does not set up a separate ' Absoiutive' case for 
ergative languages, the similarity is explained because there is no difference between the two 
categories. 

6.1.3.2 Minimal morphoiogicai marking 

A number of Formosan languages (including Kanakanabu, Saaroa, Tsou, Atayal, and Rukai, and 
also Yami, geographically but not genetically Fomosan) have a gap in the paradigm of bound 
pronouns. The missing form is by the lexicase analysis the third person Nominative, which is in 
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accord with the universal tendency for Nominative to be the least marked case form In a 
conventional ergative analysis, of course, this form would be 'Absolutive', and would require a 
separate generalization. This phenomenon is noted by Zeitoun (Zeitoun 1992b: 62), but only as an 
arbitrary restriction: 

'We may therefore observe a gap in the pronominal paradigm: in AF constructions [in 
1992a:43 it was 'in transitive antipassive sentences'], the enunciator can't refer to an 
invisible actor by means of a specific pronoun.' 

6.1.3.3 Relative clauses 

As mentioned earlier, regarding the 'Absolutive' case form as Nominative eliminates one 
systematic exception (Croft 1990:110) to the Keenan and Comrie 'NP accessibility hierarchy' for 
relative clauses. If Formosan and Philippine languages are ergative, and if a 'subject' is just a case 
relation-bearing Nominative constituent, then these languages ire in general consistent with the 
basic part of the hierarchy: subject<non-subject. The following examples given to me by Lillian 
Huang (Huang, p.c.) illustrate this point: 

21) [Huang, p.c, 1] 



sicyon 
like 

+tms mu 

I . 

^ qasa 
that 




squliq 
person 



minwah 
came 
-tms 



A 

Nom 
PAT 
actr 
"Abs" 



mita 
see 



kuziri 
me 



hira? 
yesterday 



'I like that person who came to see me yesterday.' 

squliq is the regent of an intransitive relative clause. The missing argument corresponding 
to squliq would be a Nominative, so this example is consistent with the NP accessibility hierarchy, 
whereas if the missing argument were labeled 'Absolutive', it would be an exception. 

22) [Huang 1993:101,(32)] 



minwah 
came 



mita? 
see 




isu? 
you 



lat 



'The person who likes you came to see you yesterday.' 
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22) is an example ot a kind of reversal of 21), with minwah 'came' on top and sicyon Mike' in the 
relative clause. Here sicyon [+tms] has been repalced by smoya [-trns] in order to allow the 
relativized NP to occupy the Nominative slot: 

23) [Huang, p.c, 9] 




sicyon 
ike 



maku? balay nbuw 
I very drink 




bina2dy 
bouglit 
+tms 



su? 
you 
Gen 
AGT 
actr 



hira? 
Nom yesterday 
PAT 
"Abs" 



qasa 
that 



I like to drink the tea you bought yesterday.' 

In this example, the embedded sentence is a transitive clause. The missing argument again 
is the Nominative by the ergative analysis, and is thus consistent with the NP accessibility 
hierarchy. It would also be compatible, at least in this respect, \.'ith a passive analysis such as 
Zeitoun implies, but again not with an ergative/absolutive representation. 

6.1.3.4 Word order 

Word order is another area in which the Nominative case form is of value If we use an 
anglocentric or 'semantic' characterization of 'subjects' and other arguments, as Greenberg and his 
successors have done and as for example Huang does for Formosan languages (Huang 1993:11- 
12), important generalizations would be lost in a language like Tsou (Zeitoun MS (1993):6, fh. 
10). That is, if we define subject to mean 'grammatical subject', then we can identify Tsou as a 
subject final language (Zeitoun 1992b: 3-4). As Zeitoun puts it, 

'Following Greenberg's (1963) language typology, Tsou can, therefore, be defined as a V- 
0-S language.' 

Ah, would that that were so! Even if we amend Zeitoun's statement to ' V-X-S language' to allow 
for non-object arguments, she is still giving Greenberg too much credit to say that he was doing the 
same thing. He did not do, and could not possibly have done, a full grammatical analysis of every 
language he included in his surveys. Instead, ' subject' for him was not the grammatical subject, as 
Zeitoun is quite properly using the term, but just whatever translated the English subject, that is, 
S+A. 

6.1.4 actr 

The category actr by itself is a very useful one in that it makes possible the statement of 
universal generalizations about two constructions in a way which applies equally well to ergative 
and accusative languages: 
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Imperative constructions: 



The participant which is ordered to perform an action in an imperative construction is an 
Actor, and the actant which may be omissible in imperatives... is the Actor. (Starosta 
1988:151). 

I provided examples from Tagalog and English here to show that this generalization applies 
independently of the ergative-accusative distinction. This analysis avoids the awkwardness of 
Dixon's claim 'that all imperative constructions follow an accusative pattern' (Croft 1990:153), 
even when they have no accusative case form in the syntax at all, and the contortions Comrie goes 
through (Comrie 1981 :1 1 1) in coming to grips with the same problem. 

Reflexive constructions: 

'The element which usually controls reflexivization is the Actor rather than the subject.... In 
English, and in accusative languages in general. Actor and subject (Nominaitve) coincide. 
However, we can see that the crucial category is Actor rather than Nominative by 
comparing... examples from an ergative language, Tagalog.... (Starosta 1988:152-153) 

In most or all of the cases when Comrie speaks of of "accusative syntax', then, e.g. 

'Let us therefore turn to some constructions where there is a pragmatic expectation of 
cross-linguistic bias in favor of one particular kind of syntax, in fact in favor of accusative 
syntax.' (Comrie 1984:93-94) 

I think it is fairly safe to conclude that there is a generalization waiting to be stated in terms of the 
category actr. The same is true of Dixon's " S/A pivot': 

(i) S/A pivot: the coreferential NP must be in derived S or A function in one (or both) 
clauses.... 

'Pivot' is a language-particular category that is entirely syntactic in nature and application.' 
(Dixon 1979:121-122). 

The generalizations that Dixon would make in terms of his language-specific ad hoc pivot can be 
recast directly in terms of the universal category actr. 

7. Conclusion 

Since the end of my allotted twenty pages is at hand, I will only list some of the remaining 
generalizations that are made possible within the Formosan language family and beyond by using 
combinations of one or more of the lexicase primitives. 

Nom and PAT: ergative languages are those in which the PAT is always marked by the 
Nominative case form. Note in contrast that a description of ergative syntax stated in terms of 
participant roles instead of syntactically based primitives (Huang MS (1993)) cannot describe the 
case-marking pattern without a disjunction, a many-to-many relation between participant roles and 
case forms. A similar comment applies to Zeitoun's earlier analysis: 

'...the nominative case co-occurs with the in-focus NP of a given sentence, i.e. the 
S(ubject) of an intransitive clause, the A(gent) of an antipassive sentence and the P(atient) 
of a transitive passive sentence.' (Zeitoun 1993a: 52) 

AGT and transitivity: The presence of an AGT implies that the verb is transitive and vice versa. 
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PAT and actr: the missing actr in an infinitival complement is corsferential with the PAT of the 
regent verb. 

actr, Nom, and transitivity: In Formosan and Western Austronesian languages with "pronoun 
attraction'/" clitic climbing', the coreference relation between the clitic pronoun attached to the 
auxiliary verb and the overt or covert noun it coreferences in the dependent clause can be stated 
neatly: actr-to-actr, Nom-to-Nom (Starosta 1986). The reader is invited to try this out with the 
examples given elsewhere in this paper. This requirement turns out to explain the agreement' in 
"active/passive' in Tsou which Zeitoun notes but doesn't try to account for (Zeitoun 1992a: 10, 
1992b: 11-12, 19). In fact, by abandoning the "auxiliaries as main verbs' analysis in the later version, 
she gets even farther away from a real solution. 
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A 'MiniBalist' Approach to a Contrastive 
Aalysis of English, Chinese and Japanese 

Ting-chi Tang 
National Tsing-Hwa University 



1. Introduction . 

It has often been coo^lained by language teachers and conputational 
linguists that Govemaent -Binding Theory (or its latest developoents as the 
Principles-and-P&raneters Approach and the Minimalist Program) is too abstract 
in conten* and too esoteric in form to be of much use for language teaching or 
machine translation. In this pater, we will propose a 'minimalist' approach to 
contrastive analysis, in which the theta-grid constitutes an essential part of 
lexical entries, and the computational sv tern which regulates the projection 
of the content of theta-grids in teti« of the X-bar theory and other 
principles of universal graiM»r. In this approach, sentences are simply 
projections of the obligatory arguments registered in the theta-grids of 
predicates, coabined with svdwtitution or adjunction of various optional 
arguments. The role of Affect o, or the rule system in general, is. so 
di-asticaliy reduced as to become almost noti -existent, and D-structure and 
S-structure, t<hich seem to play no significant role in language t«»ching or 
machine translation, may be altogether eliminated. The paper consists of six 
sections. After a brief introductory section, section 2 and 3 pre8«it our 
theoretical assuaptions and survey the semantico-syntactic proparties of 
predicates as related to the theta-roles of their associated arguments. 
Section 4 then discusses how these semantico-syntactic pn^perties can be 
incorporated in a very simple format of theta-grids, while section 5 
investigates the conditions and constraints on the projection of the syntactic 
information presented in theta-grids. Finally, the concluding section 
suamarizes how contrastive analysis of three genetically unrelated and 
typologically distinct languages, English, Chinese and Japanese, can be 
conducted by comparing the contents of the theta-grids between the 
corresponding predicates in these languages and investigating the manner in 
which they are projected into surface sentences. The relevance and value of 
the 'minimalist' api»roach to language typology and machine translation is also 
briefly touched upon in the final section. 

2. Theoretical Assumptions 

Before going into a detailed discussion of our grammatical theory and 
analysis, we will briefly present our theoretical assumptions concerning how a 
natural language can be analyzed and generated in its simplest terms, how 
different languages can be compared with, or transferred from, each other in a 
most straii^tforward manner, and in what sense our approach can be called 
'minimalist'. 

(1) Lar«uage, or lar«uage faculty underlying it, can be analyzed as consisting 
mainly of two components: the lexicon and the computational system. 

(2) The lexicon can be considered as the sum total of lexical items, which 
include words, morphemes and idioms occurring in a particular language. A 
lexical item, in turn, can be analyzed as a complex of phonetic, syntactic and 
semantic features specified in the lexical entries of these lexical items. One 
of the main sources of the idiosyncracies of a particular language or 
particular gramMU" lies in the lexicon. 

(3) The computational system, on the other hand, consists of a very limited 
nuiriser of principles, along with a few parameters, the values of which (e.g. 
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the plus or minus value, or a choice aaong several alternative items) are left 
for individual languages to fix. Since these principles are highly interactive 
with each other, different selections of the paranetric values also lead to 
considerable differences between languages. 

(4) The cooputational systea so conceived can be considered as a theoretical 
model for univ«'sal granmar (UG), whose principles and parameters characterize 
and constrain the core gramaar (CG) of all natural languages. In addition to 
the core graaicir, which constitutes the main body of a particular gramiar 
(PG), individual languages may also contain a limited aaount of peripheral 
gramaar to handle their marked phenooena or constructions, resulting in 
further discrepancies between languages. 

(5) SentecKes consist of predicates, including verbs, adjectives and nouns, 
and their accompanying arguments, oblij^tory or optional. That is, sentences 
can be simply analyzed as rejections of the syntactic properties of 
predicates. Furthermore, the projections must be constrained or licensed by 
the principles and parameters of universal graa'^ar. Thus, our main concerns 
will be: (a) Exactly what syntactic properties of predicates are relevant to 
the projection into sentences? (b) How can these syntactic properties be most 
simply and generally roistered in the lexical entries of predicates? And (c) 
in what manner and under what constraints are these syntactic properties of 
predicates projected into sentences? 

(6) Our approach purports to be 'minimalist' in that not only do we admit of 
only the lexicon and the computational systea, thereby drastically reducing 
the role of the rule system in our graaoEir, but we also derive the surface 
structure of a sentence without recourse to its deep structure. Moreover, all 
the syntactic information necessary for the projection into sentences is 
registered in the simplest way in the form of theta-grids for predicates, 
which will straightforwardly map onto sentences. This approach will not only 
simplify contrastive analysis between languages by pointing out the 
similarities and dissimilarities between the contents of the theta-grids of 
the corresponding predicates and the ways they project into sentences, but 
will also facilitate machine translation by rendering parsing rules and 
transfer rules virtually unnecessary. 

3. Syntactic Properties of Predicates 

Syntactic properties of predicate verbs, adjectives and nouns which are 
crucially relevant to projection into sentences include the following: 

(1) Argument propa-ties of predicates; that is, how many obligatory arguments 
do these predicates require to form a cooplete sentence? Are they 'one- tern' 
predicates (e.g. intransitive verbs), 'two-term' predicates (e.g. transitive 
verbs) or 'three-term' predicates (e.g. ditransitive verbs and complex 
transitive verbs)? 

(2) Thematic properties of predicates: that is, what kind of semantic roles do 
these arguments play? Do they play the thematic role of Agent, Experiencer, 
Theme, Source, Goal, Benefactive, Instrument, Location or Time? 

(3) Categorial features of arguments: that is, what syntactic category do 
these arguments belong to? Are they a noun phrase (NP), an adjective phrase 
(AP), an adverb phrase (AdP), a prepositional phrase (PP) or a clause (IP or 
CP)? If it is a clause, then what semantic type (e.g. declarative, 
interrogative, exclamatory) and syntactic type (e.g. finite, infinitival, 
participial, gemndive) does it belong to? 

(4) Syntactic functions of arguments: that is, what syntactic functions do 
these arguments perform? Do they serve as subject, object, complement or 
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adjunct? And what position do tliey fill in a sentence? 

Therefore, we are concerned with two probleos, both in theory and 
execution: how can we register these syntactic properties in the lexical 
entries of predicates in as simple and explicit a manner as possible: and how 
can we project these syntactic properties of predicates into sentences in as 
economical and straightforward a manner as possible. Our solution to the first 
problem will be the compilation of theta-grids for predicates, which enploy 
theta-roles as basic units of lexical inforaaiton. 

Our proposed theta-grids consist of theta-roles which indicate semantic 
roles played by the arguments associated with predicates. It is still a moot 
question how many theta-roles should be recognized in universal grammar and 
how each theta-role should be defined and distinguished from others. The 
selection and determination of theta-roles must satisfy the criteria of 
universality, optimal ity and objectivity. We will not, however, go into a 
detailed technical discussion of how to set up a universal set of theta-roles, 
but rather, will heuristical ly present theta-roles which we think are useful 
in our discussion of contrastive analysis and machine translation, along with 
their semantic import, canonical structure realzation, collocation with 
adpositions (including prepositions and postpositions) and distribution in a 
sentence. 

(i) Agent (Ag): the volunteu^ and self-control lable instigator of the action 
identified by an actional verb, typically an animate or human NP: always 
occurring as subject of an active sentence or introduced by agentive 
adpositions 'by: X. I^* in a passive sentence, for example: 

(1) ci. [Ag John ] smashed the vase with a haaoer. 

b. [a« 'i^m 1 mmmmrmL 

c. [Ag :fcfiPA« ] ^«lTrEft*«loTt*-9fc, 

(2) a. The vase weis smashed [Ag by John ] with a hammer. 

b. ms. [Ag m^m ] mmmmwj, 

c. [Ag ] ^UTmhtixit-^tz, 

(ii) Experiencer (Ex): the non-voluntary or non-self -control lable participant 
of perception or cognition identified by a stative verb, or one that is 
affected by a genuine psychological event or mental state, typical} y an 
animate or human NP, and capable of occurring as subject of an active sentence 
(as in (3) and (5)), the object of agentive adpositions 'by: ft, M> IZ' 
in a passive sentence (as in (4)) or the object of psychological causative 
verbs (as in (6) and (7)): 

(3) a. [ex John ] ({unitentionally/ ♦intentionally}) heard Mary's words. 

b. [bx 'bm ] ({Mifc*/ *^im}) «T'h*«ie, 



1. Unlike English ('look at' v. 'see', 'listen to' v. 'hear') and Chinese C^' v. 'S 
iMs/M)', 'K' V. 'ttl^M) '). which have a pair of actional versus stative verbs, 
Japanese C^S' v. '^^S', '|VI< ' v. 'PfJCljt*') has a pair of transitive-actional 
versus intransitive-stative verbs. Thus, while English and Chinese aay have a passive 
sentence with a stative verb of perception. Japanese can have a passive sentence only 
with a transitive-actional verb, but not with an intransitlve-statlve verb. Moreover, 
the Experiencer in (3c) light be better analyzed as Goal. 
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(4) a. Mary's words were ( lunintentionally/ 'intentionally 1 ) heard 

[j, by John]. 

c. ^BmiiriJ''/j< ( mwy u-^^t) ) [../go Mtci niicru^cn 12] 

(5) a. John] fears his father. 

b. M]] ti'Hrylliio 

(6) a. John's remarks greatly surprised [r, everyone]. 

b. 'bmmm [e. ;^^»o 

(7) a. John struck [r. Mary] as pompous. 

b. /J^U3^ [../GO /j^l^l] Wt^i^J^MAfiAo [3] 

c. [../GO ?t?(c] {|j[ti4-g]^<&^-;l/co [3] 

(iii) Theme (Th): the entity that exists, moves or changes; when used with a 
locational verb it denotes an entity that exists (as in (8) through (10)), when 
used with a transitional verb it denotes an entity that moves (as in (11) 
through (13)), and when used with a transitional verb it denotes an entity that 
undergoes a change (as in (lU) and (15)); typically an NP (animate or inanimate, 
concrete or abstract) and may occur as the subjept of a sentence (as in (8), 
(11), (14)), the object of a transitive verb (as in (1a,b), (12), (13), (15)) 
or an adposition (as in (10b, c), (12b, c), (15b, c)). In English, Themes 
following adjectives and nouns are often introduced by the preposition 'of (as 
in (16a) and (17a)) while Themes in Chinese may be either preceded by the 
preposition 'JC" or '?>}' in an active sentence (as in (I8b) and (19b)) 
, depending on whether they occur with predicate verbs or adjectives. As for 
Japanese, predicate verbs and adjectives have nothing to do with the Case- 
assignment, since Cases are all assigned by postpositions. Thus, if Themes 
occur with intransitive verbs, they are invariably assigned the subject-marker 
' (as in (8c), (11c), (I4c), (I6c)). If Themes, on the other hand, occur 
with transitive verbs, they are more often than not assigned -the object-marker 
';|r' (as in (10c). (12c), (13c). (15c). (18c)). Furthermore, when Themes serve 
as the topics of sentences, they are assigned the topic-marker ' (as in 
(lie), (14c)). 

(8) a. [rh The dot] is inside the circle. 

b. [t. Sfe] MSfflo 

c. [rh PJ(7>f{C*^o 

(9) a. The circle contains [th the dot]. 

b. BlfflMisW [rh HAl o 

c. Fq(7)4^(C [th ,W ^-So w 

(10) a. John put [th the book] on the bookshelf. 

b. [tb }G«] iA:&nm±.o 

c. Mm [t. *ffi!io±(cal^fco 

(11) a. [ti, The car] rolled down the slope. 

b. [th Hm mihmmr^o 

c. [th * {^</ii) ] iSJcm-'T$Ef^L^Co 

(12) a. John gave [th the book] to Mary. 

b. fm [th 

c. i^mi [th ^<d^^] :?KT-Jc*Jt7co 

(13) a. Mary got [th the book] from John. 

b. /J^w/J^^i^ mm) mmj Lth m-^m) o 

c. m-f-i [th ^(Dy^^] MU'^hh-yfz 
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2. The permutation between 'Afil^iC and '^^T-<^t^^^^' i" (^c) results not from 
Passlvlsatlon but from Scrambling. 

3. The Experlencer occurring in Chinese (7b) and Japanese (7c) might be better 
interpreted as Goal. 

4. Japanese does not seem to have a verb corresponding to the English 'contain' 
and the Chinese '^H' tlie sense used here, and (9c) is simply the result 
pf scrambling (8c). The nearest possible translation in Japanese may be 'fT){i 
^■(^rl'lC [th ^f/vTl^-S)' . 
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(U) a. [jh The prince] turned into a frog. 

b. [th D WrmiU, 

c. [,,. r-m Hi/m 1 *[:{C5(^^v.rL* -/.:„ 

(15) a. The magic wand turned [ji, the prince] into a frog. 

(16) a. John { likes/ is fond ofl [th music]. 

b. ^mm^ [rn ^m] . 

c. Mm [th t^^<] ^iT-^A^o 

(17) a. (That the enemy might destroy [j,, the town] /The enemy's possible 

destruction [th o£ the town] ) never came into the general's mind. 

b. mwmmmm- m [th mm] o 

(18) a. Mary has { cleaned [th the room] / [tu the room] cleaned ) . 

b. mm J mnn] / Cth rM^n] rrmii ^ 

c. m-ii [th ^ii?M^i mmuz. 

(19) a. He is very much concerned [th about you]. 

b. m [mm [th {^fi / [th Mwrmm'C^\ o 

c. mt { [th S<7DCi^] 'C^SiLT/ [th/Go ^(TDCifC] IHliC^^^tL-Tl 

(Iv) Goal (Go): the receiver as destination, including the later state or end 
result of some action or change, animate (recipient) or inanimate (destination 
or temporal end-point); often occurring as complements (as in (20)) or as 
adverbials (as in (2h) and (25)), but sometimes as subjects (as in (22)) or as 
objects (as in (21 a, b) and (23)). Recipient complements are often introduced by 
the adpositions 'to; |&; tC' (as in (20)), destination adverbials by 'to, into, 
onto; 5iJ; (C, (as in (2h)) , and temporal end-point adverbials by 'to, till, 
until, through; f!]; ^X"(as in (25)): 

(20) a. John gave his old car [c„ to Mary]. 

b. 4^mmmmv\'rm ico Mi^m] . 

c. :;*;;i:i|Uif'ri->'1'^ [co -/liTfc] ,^-.^^„ 

(21) a. John gave [<;„ Mary] his old car. 

b. fb^m [g„ (i^) /Jn^] iMmm-. 

c. Mm [go ?E-P1c] i!ri,^|i^,^o^c., 

(22) a. [go John] finally received. the letter. 

b. [go A^m] mmnm-m$o 

C. [go Mm] '^^,^{C^<7^#^^'g^tK-r.fCo 

(23) a. The letter finally reached [go John]. 

b. m-mmmmr ico 4mm] o 

c. ^c7)#la{iol^{c [go ;^6|5(D^7c;{c] Jg^,^/■Co 

(24) a. They traveled from Boston [go to New York]. 

b. mmi&±mmn [go m^j] o 

(25) a. He will be staying here from June {to/ till/ until/ through) 

[go August]. 

b. m'i}ty^<n [go mAm mr-fumm. 

c. ^A:i^iA^;6^«i (go AJ^tX) ^Cizi^^tEt^o 

(v) Source (So): the origin or starting point, including an earlier location, 
state or time point, often used in conjunction with Goal, which indicates 
the later location, state or time point; mostly occurring as adverbials and 
introduced by such adpositions as 'from, since (time point); 

li/ 17) ^il^ (time point); j^^^j' (as in (26), (27) and (28)). When used with' 
verbs of trading and transition, however, Source can be a human NP (as in 
(29)) and, moreover, may occur as subject (as in (30) and (32)) or as object 
(as in (31)): 

(26) a. They moved [,„ from the city] into the country. 

b. mif^i.o i^t*fifnR?iijim. 

(27) a. The meeting lasted [,„ from nine] to eleven. 
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b. m t^o i;L!P,'i] imm i- n., 

(28) a. I have been hereTTI si^nce this morning]. 

c . mi [.->.." ^'I'JI/JiO 1 ■ ■ - ch C C {c I > /::„ 

(29) a. John bought the house [.,„ from Mary] . 

b. /jsiiji [,„ m^r^ (}im[) i wrm-mnj:^ 

(30) a. Mary] sold the house to John. 

b. '\^m m -murffm'bm., 

(31) a. The magic wand turned tso/n. the prince] into a frog. 

b. mm tso/TH £-T] miULniio 

(32) a. [so/Th The prince] turned into a frog. 

b. t.„/TH £7-1 m^jm^Eo 

(vi) Benefactive (Be): the person affected favorably (beneficiary) or 
unfavorably (malef iciary) because of an action performed by the agent or an 
event that took place, typically an animate or human NP; often occurring as 
complements or as adverbials and introduced by such adpositions as 'for 
(beneficiary), on (malef Iciary) . on behalf of; |^ (in preverbal adverbials or 
postverbal complements). W (only in preverbal adverbials). 3lW 
(malef iciary); (C*. (Dtcl^K' ■ When used with ditransitive verbs, however. 
Benefactive may occur in the object position (as in (33a')). 

(33) a. John bought a mink coat [r. for Mary], 
a'. John bought [b. Mary] a mink coat. 

b. /jNnjIRT-MfS;^* ii^'l^W] o 
b'. /jNii;] Cb. ^E&/j^^i?] 1[T--'Mj^:;*^*o 

c. Mma •y'^<^=i~\^^ in-- ^ETtc] R-T^>{f/-c„ 

(3'») a. John cleaned the room [o. for Mary]. 

b. /jMin in. m/m 'hM^i trnTMnijo 

(35) a. John bought the book for Mary] onbehalfof his brotlier J . 

(36) a. The joke was tn. on me]; Don't play jokes [b. on him J. 

c. ^(D%mt [b. ^1 iii]it^>n/-ct<7)^c; 

Moreover, Benefactive may be generalized to include the following cases. 

(37) a John] | suffered a stroke / underwent an operation ) last night. 

b. fm\ nr^nfeh I'i'TM/Brrjji n 

(38) a. [„, Mary] i had/ got ) her arm broken by accident, 
b. tn, ^/J^«f-^'l^#Wr7o 

(vii) Instrument (In): the thing, tool, device or means used by Agent, typically 
an inanimate NP (concrete (tool) or abstract (means)); often used as adverbials 
and introduced by such adpositions as 'with (tool), by (means); ffl (tools, 
means). , (transportation); 7". In the absence of Agent, however. 
Instrument may be promoted to become the subject (as in (42)). 

(39) a. John crushed the piggybank [,„ with a hammer]. 

b. 'm t,„ mm mimkn 

(UO) a. John got the money from Mary ti„ by a trick]. 

b. 'm [,„ ui^iii-i m-^^ K)\m.) Mrtio 
,o BEST COPY AVWLABLi. 



(^11) a. Mary went to Boston [,„ by { plane/car/sea 1 ]. 

c. ^^j-ii [,„ {TIl^f^/^|f./)^{n ti Wy.vy^u >tz. 

(42) a. [,„ John's hammer] crushed the piggybank. 

(vlil) Location (Lo): the location or spatial orientation of the state or action 
Identified by the verb, typically a locative NP, often occurring as complements 
or as adverblals and Introduced by such adposltlons as 'at, In, on under 
beside, across . ; lir... ( (ffO) / [■ (fjj}) ^^^^y /Tfilk/m/ -] : (H 

\^^/ ±/V/m/^'\t^\'^/--] UC (complement) /"e (adverbial)} '. 

(43) a. He Is studying [l„ at the library]. 

c. mt [to mm^-^] m^Lx\^^, 

(44) a. She stayed [l„ In the room] alone. 

b. m-mKm [..a ^mvm (M) i . 

(45) a. • John put the pistol (,,„ on the table]. 

c. Mmt'xh [,.o y--yji (D±iz ] m\,^tz„ 

(46) a. It Is very noisy [l„ in the city] ; 

[,,„ The city] is very noisy, 
b. [to TOMl 'i^'irm. 

(ix) Time (Ti): the time or temporal coordinate of the state or action 
identified by the verb, typically a temporal NP, often occurring as adverblals 
and Introduced by such adposltlons as 'at, in, on, during, before, after, . ; 

{mm/-^?\'/mi/Pm/-] . tj^-, m/m iz'. "Bare-W time adverbs 
such as 'today, tomorrow, day after tomorrow, yesterday, day before yesterday, 
{next/ last} { week/ month/ year} ; {^/nn/m/^./m\ ^. iT/.h} (PW 

)i\ . . however, may occur witliout adposltlons or even as subjects 

(as in (51)) or complements (as in (52)). 

(47) a. They arrived [t, at 10] and departed [t, at 10:30]. 

b. <te [t, (p i'WMVmm, [t, m -hm-] mm. 

f'^^^Jll 1 01^/^] Si]«L, "Tx, 1 OiH^tc] (Cd^) muz. 

(48) a. Mary set the d?te f7, for Monday]. 

b. n^mmBimr [x. ^sj^j^To 

(49) a. Edison was born [tt in 7847] and died [x, in 1931] 
b. { [x, ^l847^r|jB4, [x, ^1931^.] mm/ 

m [t, ^18W^] . ^]ti: [„ |^l93t^] } , 

(50) a. I met John [yi yesterday]. — 

b. 5fe [x, ni-:,^] mn^mo 

(51) a. [it Tomorrow] will be another day. 

b- [x. nfl^] xji-Mrt^jcrr-o 

(52) a. My birthday was [xi day before yesterday ]. 

b. mjikBMi [x. m^] o 

r.. %(Dm^:i'.B\ii [x. -iKplHl -eL/>:. 
X) Quantity (Qu): the generalized range or an "arch-role", which includes such 
allo-roles" as "number" (Qn, as in (53) and (54)), "duration" (Qd. as ^55) 
through (5-7) ). "cost" (Qc, as in (58) through (60), "length" (Ql), as in (61) 
through (62), "weight" (Qw), as in (64), "volume" (Qv), as in (65) 
frequency" (Qf) and "measure" (Qm), as in (66) and (67); mostly consisting of a 
quantiricational phrase (QP. i.e. a noun phrase containing a quantifier (Q)) 
and occurring as an adjunct (often introduced by the preposition 'for' In 
English). Hlth certain predicate verbs, however. Quantity may occur as subject 
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or topic (as In (51). (5lc) and (57a)), object (as in (53) . (5'la . b) and (59a,b)). 
or complenient (as in (56), (60a.b) and (62) through (65)). 
(53) a. This hotel can acconimodate [ g,. five hundred guests). 

(51) a. This large dinner table can seat [q,, twenty persons]. 

b. M ■'H^k&i^^^iU'^ Un ..-.I -PA! 

(55) a. We studied (English) [qh for two hours]. 

b. mi mmv;) mj lo. m m) «] . 

(56) a. The conference lasted [q^ two hours]. 

b. tmimj lo. mm) . 

.c. ^Sti tod 2m^] l^l^/^o 

(57) a. [od Ten years] have elapsed since my son left. 

b. ^7;fe7MBM7 to- +^1 7o 

(58) a. I bought the book [g, for fifty dollars]. 

b. 5fe [qc miiMm. imim 1 wrm^^n 

c. mi Uc r> 0 lOb (com) C(D7t5:^R-^/c„ 

(59) a. I paid [qc fifty dollars) for the book. 

b. ^mjm^m^T loc rimm<^.] . 

(60) a. The book cost me [q^ fifty dollars]. 

b. ii**?E7ife toe B.-m^^] o 

c. CtDTfctmJC [q, 5 0 K;b (^) ] ^^'^-ti-^c; 
|lx{iC(7)7i5:{c [q. f) 0 lOb (^) ] ^A^'L^^io 

(61) a. The boat measures [g, 20 feet]. 

c. l^(7)g:${i [q, 2 0 7^-1^] /d"; 

Cco-K- h{i|l:$/)< toi 2 0 7 ^ - h] ifc'So 

(62) a. The forest stretched [g, for miles]. 

b. M'-mmmi^ u. iimm^rr 

(63) a. John stands [g, six feet]. 

b. «M to, /V^Rl: [q. A5iK] i^o 

c. M(DMmi. to. 6 7 ^ - M ^c; 
A6R{ia-g;&< to. 6 7 ^ - M *-5o 

(64) a. Mary weighs [qw one hundred pounds]. 

b. /jN^fiSfi to. loofigl; 'hOT Iqw 1W1 «o 

c. ^Ei^cDMti to. ioo:i?>Kl ti; to, iooH?>K] *^>o 

(65) a. The cell measured [q, [q, eight feet] by tqi five] [g, eight] high]. 

b. mmmm (W) to. to. a^ri to. h^ri 

[q. AlltR] i^l o 

c. -tcDfii^Mti to. to. 8 7^-M.^$to. 5 7^-1^1. 

to. 8 7^- 1-]] ^-So 

(66) a. He meet [q( [o« twice] a week]. 

b. M to, mmm] m\ to. mj^i; to. ^s)wi ^ to. ffiio 

c. Mti to. tSiSI to. 2|£] ^-Ti^So 

(67) a. They dine together to( every three days]. 

b. {ib'tf^io. nzvK] fc« to- -;^] o 

c. to. 3Eifc#jc] torn -m -'m■^'k^l^Lr\.^6o 

(xi) Proposition (Po): the "arch-role" which consists of the subject and the 
predicate expressing a state, an event or an action. English propositions can 
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be divided into semantic types such as ( i ) "declarative" (Pd), ( ii ) 
"interrogative" (Pq) [5] and (Hi) "exclamatory" (Px), or syntactic types such as 
( 1 ) "finite clauses" (Pf), ( ii ) "infinitival clauses with the complement izei- 
'for'" (Pi). ( iii ) "infinitival clauses without the complementizer 'for'" (Pi), 
( iv ) "gerundive clauses with the genitive subject" (Pg), (v) "gerundTve 
clauses with the accusative subject" { ?g) , ( vi) "finite clauses with the past- 
tense verb" (Pp). ( vii) "finite clauses with the root-form verb" (Pr), ( viii ) 
"infinitival (clauses) with an empty subject" (Pe), ( ix ) "participial or 
gerundive (clauses) with an empty subject" (Pe), and (x) "small clauses" (Ps). 
With Chinese and Japanese, however, no such elaborate subclassif ication of 
syntactic types is necessary, and we need only to identify four proposition 
types, namely, Pd, Pq, Px and Pe. 

(68) a. I know (p, ( that ) John is a nice boy]. 

b. mm [p. ^7%!ivj]m o 

(69) a. She whispered [j^ * (that) she had secretly bought the car]. 

b. mmf (Ji!i) ^ii, [p. mmmmm 

(70) a. 1 asked Mary (p, (whether / if} she knew the answer ]. 

b. mm<r\i: [p, m iM m m 'gmrm ., 

(Tp-a. l^e don't know [p, ( whether (or not) /when /where / how | we should go]. 

(72) a. l^e don't knowTr, ( whether (or not) / when/where/how | PRO to go ] . 

b. m^mm pro \mm./\AMt&'^</^M{[]mR/i^<.sM\ ik\ 

c. #Aii{i [p, PRO ( ^ /-c L T/ 1 1 z.^ / EnK^ni^n \ z (Lt) 1 
fr< 'm^tiW. 

(73) a. Could you tell us ( [p, what we should/ [p, what PRO toj do] 7 

b. f^rngf'fiDM { [„, ?iir]/T7; PRo"~rMWMli ? 

(71) a. I didn't know [p, what a smart girl Mary is]. 

b. nm^nrM [p, (Ef^j mmfi^mMrkmj-] „ 

c. [[p. ^tWc^^^ic|/^/;<^^^ (ta ] titrY\hU^}^-^tz„ 

(75) a. They never imagined [77 how very smart she is]. 

c. [(p. mWCAJl}zm<^] iii) (.WizhTmh-^^f}--^tz. 

(76) a. He consider {T77 that Shakespeare is/ [p^ Shakespeare to be / 

[p, Shakespeare a great poet). 

b. [p. feltiS^ji^AWf^Al o 

(77) a. John expects ( [p, that Mary will/ [p, Mary to) succeed ] 

(78) a. John wanted {(it) very much [p, for Mary/ (p, Mary 1 to succeed]. 



5. Interrogative clauses can be further subdivided into "finite interrogatives" 
(Pq) and "infinitival interrogatives" (Pq). Declarative clauses can also be 
subclassified into those in which the complementizer 'that' may be optionally 
deleted (Pf) and those in which tlie complementizer 'that' may not be deleted 
( Pf; e.g. the complementizer 'that' following "manner of speaking" verbs such 
as 'shout, .scream, shriek, mumble, mistter. whine, lisp, whisper, murmur, quip. 

'6. He can al.so say '-f/o^U. (p, {^JMn (^V-J:.lr) ( j < ^ /;>cl: ') ^1 %'i\^'>ti 
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(79) a. Do you mind | [p, my/ [ p, mel wearing your necktie]'? 

h. mwMrm ^<iim'^ 

c. (Ird di/;i/r.c;n^ r ^ oftTl Jj] l^+ 

(80) a. I wish [pp I were a bird]. 

h. men [p., teWi^i „ 

(8)) a. John insisted [n, that Mary jbe/ stayl here with him]. 

(82) a. They found { [p, that the place was / [ p , the place to be / 

[ p , the place <f> ] deserted ] . 

b. m'i^^ [pd jimmh'-^MAm . 

c. W^\t [[pd ^coii;Tr^c^5^Hl^/il^] c<!:^ ] fgBL/c,„ 

(83) a. John saw [p, ( Mary / her ] 6 walk into the restaurant]. 

b. >mM^ [pd \'\<W7nn^-mm . 

(84) a. John tried [p, PRO to reach Mary]. 

b. /jNiinriri (p. PRO (io lui^'bm 

c. i:g|5{± [[p, PRO n'r\zm^L^] t] U^i-fz^ 

(85) a. John forced Mary [p. PRO to marry hiiiiT. 

b. /]Nn;iiSiie/iNm tp. pro mmm] n 

(86) a. John promised Mary [p, PRO to marry heFT 

b. 'bmrm'H^- [pc pro mmnm 

In the above discussion, we have postulated and identified a set of tlieta-roles 
which seem to be necessary for contrastive analysis of English, Chinese and 
Japanese, based on such formal criteria as (i) the Principle of One-Instance- 
per-Clause (i.e. only one instance of each theta-role may occur in a simple 
clause [7], (ii) the Principle of Complementary Distribution (i.e. those 
arguments which are in complementary distribution must fall under the same 
theta-role), (iii) the Principle of Conjoitiability (i.e. only arguments which 
fall under the same theta-role can be conjoined, and (iv) the Principle of 
Comparability (i.e. only arguments which fall under the same theta-role can be 
compared). Our postulated theta-roles are perhaps more concrete than those 
proposed by other scholars, because we have taken into consideration not only 
the semantic content of theta-roles but also their distribution in terms of 
syntactic function, canonical structure realization in terms of syntactic 
category, and when they occur with adpositions (including prepositions and 
postpositions), unmarked manifestations of these adpositions. Thus, our 
postulated theta-roles ( ( i ) through (xi)), syntactic functions into which 
they may enter, syntactic categories in which they may occur, and adpositions 
with which they may typically co-occur are summarized as follows. 



7. This principle, however, does not prohibit the same theta-role from occurring 
as obligatory and optional arguments in a simple clause. In the following 
sentences, for example, two instances of Location and Benefactive occur, one 
as semantic argument (i.e. adjunct) and another as indirect interna' argument 
(i.e. complement). 

(i) a. (While) [,.„ in the classroom ] Mary placed the flowers (,.„ on the 

tGschcr' s d6sk ] 

b.i,.„ ^mMr^bwum mm . 7-± ] . 

(ii) a. John bought a .wristwatch [n, for Hary][n. on behalf of her mother]. 

b. /jMiji (t vifti 'hmmw ffr '^r- h.'WhWV ^ 
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(87) 
( i ) Agrnt 



(Ag) 

( ii ) Experiencer (Ex) 



\t m. *ft; (C) 



( iii ) Theme 



( iv) Goal 



(Th] 



(Co); 



to. 



( V ) Source 



(So): 



'from (unmarked), since (temporal); 
(locative), {g/tfl (temporal); 



( vi ) Benefactive (Be); 



(a) subject (animate NP); 

(b) oblique object (PP; 'by; 

(a) subject (animate NP); 

(b) object (animate NP) ; 

(c) oblique object (PP; 'by; m, {^; ) . 

(a) subject (animate or inanimate NP): 

(b) object (animate or inanimate NP); 

(c) oblique object (PP; 'of; |12, ff; 

(a) complement (PP; 'to (unmarked), into, onto 
(locative), till, until, through (temporal); 
jf^ (animate recipient), SlJ (locative or temporal 
end-point); {C (unmarked), ^ (locative), 

(temporal)'; NF (when preposed in English and 
Chinese ditransitive constructions)); 

(b) adjunct (PP; adpositions (the same as those in 
complements) ) ; 

(c) subject (NP); 

(d) object (NP). 

(a) adjunct (PP; 
^JSl^ (unmarked), 
^^^,'); 

(b) subject (NP); 

(c) object (NP). 

(a) complement (PP with animate. object NP; 'for; i^; 
}C'); NP (when preposed in English ditransitive 
constructions, animate); 

(b) adjunct (PP with aminate object NP; 'for, 
on behalf of; ^, ^. JgJ; tC, (DtMZ'); 

(c) subject (animate NP). 

(a) adjunct (PP with inanimate object NP; 'with (tool), 
by (means); j]] (tool, means), f^, ^ (transportation); 

-e'): 

(b) subject (inanimate NP). 

(a) comple.nent and adjunct (PP with locative object NP; 
'at, in, on, under, beside, across, •••; 

( m) /± (U) /T (iS) /'^»/^m/-) ); 

<7y \K (comlement)/ 

-e (adjunct)) '); 

(b) subjsct (locative NP). 

(a) adjunct (PP with temporal object NP; 'at, in, on, 
during, before, after, ...; i^- m\^^/-^'^/m/ 

; l^/m/'ik) (C; NP (bare-NP temporal adverb) ; 

(b) .subject (temporal NP). 

(a) complement (QP); 

(b) adjunct (PP with QP as object; 'for'; QP; QP); 

(c) subject (QP); 

(d) object (QP). 

(a) subject (Pd, Pq, Pq, Px, Pf, Pi, Pg, Pr, Pe, Pe, 
Pd, Pq, Pq, Px, Pe; Pd, Pq, Pq, Px, Pe); 

(b) object (Pd, Pd, Pq, Pg, Px, Pf; Pi, Pi, Pg, Pg, Pe, Pe 
Pp, Pr, Ps; Pd, Pq, ?g, Px, Pe; Pd, Pq, Pq, Px, Pe); 

(c) complement (Pd, Pq, P£, Px, ff, Pi, PI, Pe, Pe, Pr; 
Pd, Pq, Pq, Pe; Pd, Pq, Pc[, Pe); 

(d) adjunct TPi , Pi, Pg, Pg, Ps (English)). 

The eleven theta-roles listed above are essential hut by no means exhaustive, 
and additional theta-roles may be proposed on empirical grounds. English verbs 
occurring in (88) through (90), for example, must take as complement Manner 
(Ma), which may he adverbs or prepositional pitrases introduced hy 'with'. In 



(vii) Instrument (In); 



(vi) Locative (Lo) : 



(ix) Time 



(Ti) 



( x) Quantity (Qu): 



(xi) Proposition (Pr): 
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the correspoiiHing Chinese and Japanese verKs, however, Manner may occur either 
as complements (as in (88h) and (89b)) or as adjuncts (as in (90b), (88c), 
(89c) and (gOc)). 

(88) a. He behaved ( [m„ badllvl to his wife/ [„, with great courage] I . 

b. {Ill mkk (mm) [m... ium ]/Mm [«,. mm \ .. 

(89) a. She always treated us [m„ |well/ with the utmast courtesy) 

(90) a. I {phrased/ worded } my excuse [m„ {carefully/ with care} ]. 

Theta-roles such as Cause (Ca), Result (Re) and Condition (Co) may also be 
proposed to specify various adverbials or adjuncts. These theta-roles, moreover, 
may also occur as subjects or objects. Cause in (92), as opposed to Instrument 
in (91), for example, may occur as subject and account for the difference in 
syntactic behavior between these two theta-roles. Likewise, Result occurring as 
object in (9't) and Theme occurring as object in (93) are differentiated by their 
syntactic behaviors in p.seudo-cleft sentences in English and passive sentences 
in Chinese and Japanese. Compare. 

(91) a. John burned down the house [i„ with fire]; 

The house was burned down by John [,„ with fire]. 

b. [.n m/iM 'X] mmm-T-< m-m'^ [,„ im/m 'M m^j. 

(92) a. [c.i A fire] burned down the house; 

The house was burned down [c, by a fire]. 

b. [c, -^'Xf^] mwrm-T;, 
mr [c. ^-iwx'umm] . 

c. [r„ XW) W-^^iU^Xl-t-^l:. ; 
W [ c. X 'IfTl jtJ'£( t X L * -. 

(93) a. They finally destroyed [ji, the house]. 

[jh The house] was finally destroyed (by them); 
What they did to [n, the house] was destroy it. 

b. mmmmi m-r-]\ 

[th M-r-i mm nmv mr. 

c. ^Jfi^Ji'-Jl^K [ti. ^^] tiOCi-)Lti; 

(94) a. They finally built [co the house]; 

[go The house] was finally built (by them); 
*What they did to [co the house] was build it. 

b. {liiWA^mT [co m-T); 

c. ^«^^,{iol^^c [co ^^] mxfz: 

* [r, ^(i] oi>{c mhiz) mxhfifzo 

Note also that Manner, Cause, Result and Condition (Co) may occur as semantic 
arguments or adverbials in the form of adverbs (e.g. 'A-ly ; A Xji ; A < '), 
adpositional phrases introduced by adpositions (e.g. 'with (care), in (peace); 

El (A mw. ill (NP (-) m . (NP) -m) ; ( mm/m^/ np cd^ 

n] tC, ( {%^^/^■t^:>^/L'^■/}^] ) t, (NP O) in<'; 'for (the sake of), 
because of, owing to. wltli a view to, from (thirst), of (hunger), in order to 
(VP), so as to (VP) ; UgT/lllUVmi NP. \\>m/m^/nm) VP; (NP) cntl 
Hz), (NP) X, (V(i)) {C,(A Or'; 'in {case/the event} of or subordinate 
clauses introduced by subordinate conjunctions (e.g. ' ( as/ as if/ like) S; 
( I^W/%} ) S ( -KBi); S \f)<ia]< /:h^(r>J:')iZ\ '; ' ( for/ because/ as/ 
since/ now that) S; I WlH/ll l/i-'/'4r,fl|l S; S {(PiX /ti-C^/^A./ t:.i^^ ( tC) ) : 
'so.jA /Ad ) that/ so much so that/ such that) S; S; ••• VM^X /(J^X 

\ ■; ' jif /in case (that)/ provided (that)/ in the event that / unless) S; 
/'»/W/flyrf/^ (^) ^/nm S; ( |V-(r)u/V-(a)/ilx} t. ( (V-(Ta)/V- 
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(a)^j7!)>o/-c) ) h, ({V-(r)e/V-(a)n(aker)e)) Cf. ( {V-(r)u/V- (a)rj: t,^) ) ^ ' ). 
Moreover. adpositions (including prepositions and postpositions) and 
conjunctions can be analyzed as two-ter« predicates, with adpositions taking 
NPs as coaplenents and various aaxiaal projections (e.g. VPs, IPs) as 
specifiers, subordinate conjunctions taking subordinate clauses as coapleaents 
and principal clauses as specifiers, and coordinate conjunctions taking 
coordinate aaxiaal projections (e.g. NPs. VPs, APs, AdPs, IPs, CPs) as 
conjuncts (or, alternatively, as coapleaents and specifiers In a revised 
version of the X-bar Convention to be dlcussed in section 5). 

4. The Theta-grld: its Contents and Foraallzatlon 

As discussed above, the syntactic properties of predicate verbs, adjectives and 
nouns Include arguaent and theoatic properties of these predicates, along with 
categorlal features and syntactic functions of their accoapanying arguoents. Now our 
task is how to register these syntactic properties in the fora of theta-grids as 
siaply and explicitly as possible, so that they can be projected into sentences in as 
econoaical and strai^tforward a aanner as possible. 

(i) In principle, only obligatory arguaents (that is. internal arguaents. including 
direct internal arguaents («■ objects), indirect Internal arguaents (» indirect objects 
or coapleaents) and external arguaents (■ subjects)) will be registered in the 
theta-grld. and seaantlc arguaents (that is, adjectival and adverbial adjuncts) will 
be handled by lexical redundancy rules such as (95): 

(95) a. [. . . Xx] ~> [. . . In, Ma, So, Go, Lo, Tl, Re, Xx] 

John studied English. — > John studied English diligently at the library 
yesterday for today's exaal nation . 

b. [. . . Xx] — > [. .. Re, Tl, Lo, So, Go, In. Ma, Xx] 

'j^a^i*^. --> 'm mraMj^^gfi^i^ maam. s&m ae^. 

c. <. .. Xx> — > <. ..Re, Tl, Lo. So, Go, In. Ma. Xx>[8] 

The lexical redundancy rule (95) states that seaantic arguaents or adverbial adjuncts 
such as In(struaent), Ma(nner). So(urce), Go(al), Lo(cation), Ti(ae). Re(ason) should 
be inserted between the external arguaent 'Xx' and the rest of arguaents '. . . ' (in the 
case of absolute intransitive verbs which contain only external arguaents in their 
theta-grids, these adjuncts siaply follow the external arguaent) in the order given. 
The actual positioning of these adjuncts in surface sentences will be decided by the 
Head Paraaeter for each language. In English, which is basically head-initial in 
phrasal constructions, adverbial adjuncts follow the predicate verb and internal 
arguaents, while in Chinese and Japanese, which are basically head-final in phrasal 
constructions, adverbial adjuncts precede the predicate verb and Internal arguaents. 
As for adverbial adjuncts that are idiosyncratic in occurrence and distribution, their 
Idlosyncracles will be specified in the theta-grids of these adjuncts (see the 
relevant discussion in (xlii) below). The distinction between obligatory versus 
optional arguaents is i^x>rtant because obligatory arguaents behave differently froa 
optional ones in teras of the position they aay occupy in a sentence and the 
adposltlon they aay select. Benef actives in Chinese, for exaiq>le, aust take the 
preposition 'J&' when they occur postverbally as obligatory arguaents, but aay take 
either '^&' , '#' or 'M' "hen they occur preverbally as optional adjuncts, as 
illustrated in (96). Locatives in Japanese take the postposition when occurring 
as Internal arguaents but take 'T?' when occurring as seaantic arguaents, as 
illustrated in (97): 

T-W [Be .]._. 



Be { /m r^j'm ] iSr7-m 

U [Lo ]1ffi^T'l^S. 

u [Lo im.lL ] m^xi^?>. 




8. The angle brackets ('<...>') for Ure Japanese thota-grid indicate that the order of 
the arguaent listed aay be scraabled in surface sentences. 
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(ii) There are verbs that can be used both transitively and intransitively. Ergative 
verbs. for exaaple. can be used as inchoative-intransitive as well as 
causative-transitive verbs. In which case the external argument Agent of the ergative 
verb 'open; B8< is placed in the parentheses ('(Xx)') along with the internal 
argument Theae. as in '[Th (Ag)]', to indicate the coabination of the 
causative-transitive '[Th, Ag]' and the inchoative-intransitive '[ Th ]'. Thus, the 
theta-grid '[ Th (Ag)]' will project to yield the causative-transitive (98) and 
inchoative- intransitive (99) sentences. 

(98) a. La, John ] opend hh the door ] slowly. 

b. [Ag 'm ] ®{g4 MT [n PI ]. 

c. [Ag^fcfiPtt ]ip'o<'^t [th K7^& 1 

(99) a. [th The door ] opened slowly. 

b. [a PI ] SSife MT. 

c. [th FT'tt ] f^-p< m^tz. 

Certain transitive verbs (e.g. 'finish; j/ti^. 'tH'^^/'i')' ) "ay optionally delete 
their objects in surface sentences, in which case the internal argument aay be placed 
in the parentheses (e.g. [(Th) Ag]) to indicate its optionality and will project 
into sentences such as (100): 

(100) a. Have you already finished ([ih your hoaework ])? 

b. ifs^Em^j ([th mjmj) 

c ([Th SSfiMM)]) mi:{^/b]tz<D7[^<^] 

Thus, the English verbs 'eat. dine, devour' are distinguished in their theta-grids. 
respectively, as '[(Th) Ag]'. '[ Ag ]'["] and '[Th, Ag]', which will yield sentences 
such as (101) : 

(101) a. What ti»e do we gal ([Th dinner ])? 
b. What tiae do we dine? 

a The lion devour ed~ TTh the deer ]). 

(iil) There are also ditransitive verbs that take two internal arguaets (i.e. 
objects), direct and indirect, either of which say be optionally deleted. In this 
case, 'linked parentheses' ('(XxXYy)') aay be used to indicate an optional choice of 
either 'Xx' or 'Yy' . Moreover, with aany English and Chinesse ditransitive verbs, 
indirect objects lay either follow or precede direct objects in surface sentences. 
This optional penutation between direct and indirect internal arguments can be 
represented in their theta-grids by the use of angle brackets ( '<Xx, Yy>' )Li^. Thus, 
ditransitive verbs such as 'send; jg; 3^5', 'teach; i^; and 'ask; T^; |!0<' 

■ay have the theta-grids '[<Th (Go)> Ag] [«] ; [<Th ((|d)Go)> Ag] ; <Th (Go) Ag>', '[<(Th 



9. While the English verbs 'open; close, shut' and the Chinese verbs 'BB. JlM; M> BB 
W are all ergative verbs, aaong the Japanese verbs 'M(0'^)<. MltS. §fl(*)< : M 
05, only 'm^^X' and 'EflC'S' are ergatives. and 'MltS: 

and 'M(*>)<. W^rS' are transitives and intransitives, respectively. 

10. Many Japanese, however, seea to prefer using the intransitive verb 'lAts' , as 
illustrated in '^ 9!&Jgy:3lA-/"C<D?' . 

11. In addition to '[ Ag ]', the verb 'dine' also has the theta-grid '[Qu, Lo]', which 
yields sentences like '[Lo This table ] can dine [Qu twelve persons ]' and '[Qu How 
•any people ] can [Lo this restaurant ] dine?'. The theta-grids '[ Ag ]' and '[Qu, 
Lo]' can be coabined into one by the use of curly brackets ('{Xx/Yy)'); naaely, 
'[{Ag/Qu. Lo}]'. 

12. Since all the arguments in Japanese, obligatory or otherwise, can be scraabled or 
perauted, which Is Indicated in the theta-grid by the use of angle brackets rather 
than that of square brackets, there is no need to insert angle brackets within the 
angle brackets. 
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lGo)> Ag]; [(GolTh) Ag];<Th (Go) Ag>' and *[<(ThIof Go)> Ag ][^*]: [(GolTh) Ag] ; <Th 
(Go) Ag>', respectively, yielding sentences such as (102) through (104): 

(102) a John sent {[th a box of chocolates ] ([o, to Mary ])/ [oo >tery ] 

[ill a box or chocolates ]). 

b. m7 ([th -iirP5g.^« ] ([go mM])/ loo m^m i 

[a -^T^^.m ]}. 

c. {[n. ^33l^-hi-m ] [a, iEi^C ])/ [oo ^.^IZ ] 
[n. ^33U-hi-^ ]] m-^tz, 

(103) & Who is teaching (([ih Engilish ]) [qd to your brother ]/ 

[go your brother ] ([m English ])/ [th English ]}? 
t {[a> ] [th ]/ [co ]/ [Th D? 

c. mif ([o, m<D^iZ ]) [Th ] «tPtTl>5(0? 

(104) a te asked {[m a question ] ([so of Mr. Lee ])/ 

[so Mr. Lee ] ([tt a question ])}. 

t m\ PpIT {[go ] [th -m\m m i[oo i/ tth -mm ii. 

c. 1^*5 ([o, ^i-^^z ]) [th Hfii. 1 Mi^/-c. 

(iv) Predicate verbs that have sore than one surface realization can be so indicated 
in their theta-grids by the use of curly braces and angle brackets. Verbs like 'blaae; 

jfei^S' and 'load; $|(ic): (i: 2r*?)^tf '. which take two pemutable NP and PP 
coi^leaents in English, for exaaple, will have the theta-grids 't(Be for Ca/ Ca, on 
Be) Ag]; [Be. Ca. Ag] ; <Be, Ca, Ag>' and '[{Th. Lo/ Lo, with Th) Ag] ; [Lo, Th. Ag] ; 
<Th. Lo, Ag>'. respectively, yielding sentences such as (105) and (106): 

(105) a John blaaed (Lta Mary ] [c, for the accident ]/ [c« the accident ] 

[bb on Mary ] ). 
t 'J^ [c %,lMm ] (ifn)glS [Be 1. 
c. [c %Wl<D:L}iX' 1 ] 

(106) a John loaded {[th the furniture 1 [lo (on/ onto/ into ) the truck ]/ 

[lo the truck ]/ [n with the furniture ]). [15] 

t >m {[ih fim ] « [u a-^^ {h/g)iin ]/ [u ^-^^ { H./g)s ]) mki. 

c. [lo h^y:»(0(h/4^)tc ] [n ] (H-:>^ *)) 

Siillarly. verbs like 'talk; ^(ift); i§f' and 'hear; i^J: HB< '. which take two 
PP-co»ple«ents that can be permuted in English, will have the theta-grids '[<Go. about 
Th>AgJ; [(^)Thfl<j:(^t*. ffiCo. Ag]: <Th((OC: t)^Co^^T. Got. Ag>] ' and '[<So. about 
Th> Ex] [16]; [(:^)Thfi^S. So. Ex]; <Th(OC:t. So. Ex>'. respectively, yielding 
sentences such as (107) and (108): 



13. The underline under a theta-role ( *Xx' ) denotes tnat this theta-role Is inherently 
Cas8-«arked and thus does not require a structural Case assigned by a transitive verb 
or a preposition. 

14. The filing-out of the Goal preposition 'of' in the theta-grid for the English 
verb 'ask' indicates that this is a "larked' representation of the Goal prepositioa 
whose umuirked nanifestation is 'to'. 

15. There is a difference in interpretatioa however, between 'John loaded the 
furniture on the truck' (partitive reading) and 'John loaded the truck with the 
furniture' (holistic reading). The holistic reading is conveyed in Chinese and 
Japanese, not by the difference in word order, but by such adverbial expressions as 

and 'Ho^»)(»[tf)'. 

16. The Ehglish verb 'hear' has, in addition, the theta-grld '[{Th, So/ So. Pd) Ex]', 
corresponding to the Chinese verb 'IgSJ Th. WSi W ([{Th/ So Ig Pd) Ex]) and the 
Japanese verb '|JH<' « {Th/ PdCfc) So, Ex>), yielding sentences such as 'I heard {the 
news frca hia/ froa hia that his wife was ill): ^^(ti^ftbdHSlDli^lillii^A./ Kf^flfa^ 

-kkmi) . t^nttA><b {com./ teo#A<j|««T*ec:fc)«Mi>/-c'. 
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(107) a. John talked {[go to Mary ] about the party ]/ 

[m about the party ] [o, to Mary ]). 

b. [go m-m ] ^ [n, i^)mmtm i. 

c. loo ^E^lC ] [xh tmL(DZti ] mVtz. 

(108) a I heard {[so froa hi« ] [n, about the accident ]/ 

[ih about the accident ] [so fro« hi« ]). 

b. m [so 'iimmm) ] mi mi)^mm!t i. 

c m {[m mL<r>zt^ ] Iso m^i^ ]/ [so ] [th $t^<DZt=^ ] 

(v) Though we strictly observe the 'One-Instance-per-Clause' Principle, which requires 
that no two identical theta-roles occur as obligatory arguaents within the saae si«ple 
sentence, we will slightly loosen this constraint under the following three sets of 
circuastances. First, two identical theta-roles aay appear in the saae siaple 
sentence, if one occurs as an obligatory arguaent, and the other as a seaantic 
arguaent (i.e. adjunct), as illustrated in (109): 

(109) a [lo In the classrooa ] John sat [ut i n the front row ]. 

c ^yysii [u nt^(04^-e 1 [lo -'^^(Dmiz i ^^ti^s. 

'Syaaetric' predicates, which require seaantically plural suWects (e.g. 'kiss; ({^/ 
ffi})t%; ^V7.i^)-t^'. 'aeet: %iW ■ Ho'. 'consult; ffifi: ffiS^fS') or objects 
(e.g. 'aix; SEIC-Z'B'JS) and function either as transitive or intransitive 

verbs, will be assigned the theU-grid '[{Ag*Ag7Go, Ag) ] ' or ' [ {Th'Th'/Th'Th', Ag} ] ' 
and yield surface sentences like (110) and (111): 

(110) a {[[Ag John ] [aj' and Mary ]] kissed/ hg John ] kissed [co Mary ]}. 

b. au ±m ] [a,' m^m u mi/ [a, ±91 1 1^7 ico ±m 1. 

c. {[[Ag imt ] [Ag- i££ ] 1/ [a, iMil ] [co te-3^tc ]} ^-J7.btz. 

(111) a {[[th OH ] [ih' and water ]] won't aix/ You can't aix [a oil ] 

[ih' ( and/with) water ]). 

b. {[[n. tk] [rt' ]] ^U-^/ f^feS-^ [n. Ml [th' ffii 1). 

c. [[th 2ic^ 1 [t!.' Ml 1 « ]{ ut^^m^/ m^^zta^m^ui^ ). 

Finally, cerUin predicate verbs (e. g. 'outrun; ^SS^W&S^it- ■ ■ ■ J:*0^<M^', 

•puttalk: imm/mit,.{nm-. ... j:»jui¥c/^<i|§-r'. 'outshoot; mwm 

it--.^' . . . i*) jElfl^it'O') that seea to require identical theta-roles as both 
subject and object in their seaantic interpretation will be assigned the theta-grid 
'[Th, Th']' and yield a surface sentence like (112): 

(112) a [t» John ] can always outrun [th' Bi 11 ]. 

b. [th M ] ttAfe { mm [Th' dm ]/ [m- itym ] i. 

c. [ih 1 ^^-ot [ih- Mi*L 1 ¥<^*. 

(vi) We also strictly observe the Theta Criterion, which requires that the 
relationship between arguaents and theta-roles be one of one-to-one correspondence; 
naaely, each arguaent is assigned one and only one theta-role, and each theta-role is 
assigned to one and only one arguaent. When a certain arguaent can be interpreted as 
playing aore than one theta-role, however, we will indicate this lexical aabiguity in 
the theta-grid by the use of curly brackets and a coaaa (i.e. ' {Xx, Yy) '[^7]). Thus, 
the verbs 'roll; (fl)^; 4I^^<9$$'^) ^ ', for exaaple, will be assigned the 



17. The use of curly brackets and a coiaa ('{Xx. Yy) ') differs froa the use of curly 
brackets and a slash ('{Xx/ Yy')) while the foraer signifies that a certain arguaent 
■ay be interpreted as playing the theta-role 'Yy' as well as the theta-role 'Xx', the 
latter indicates an optional choice of either the theta-role 'Xx' or 'Yy', but not 
both. 
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theta-grids '[{Ag. Th) (Ro)]; [{Ag. Th) (Ro)]; < {Ag. Th) (Ro)>'[i8l and yield a 
sentence like (113). in which the subject NP 'John; 'j^: :k^' say receive the 
semantic interpretation of either (voluntary) Agent or (involuntary) Theie: 

(113) a [Ag.Th John ] rolled down the hill. 

Sinilarly. the verbs 'buy; M; ' and 'sell; W; ' will be assigned the 
theta-roles '[Th (So) {Ag. Go)]; [Th (So) {Ag. Go)]; <Th (So) {Ag. Go}>' and ' [Th (Go) 
{Ag. So)]; [Th (Go) {Ag. Go)]; <Th (So) {Ag. So}>'. respectively, which will yield 
sentences like (114). where the subject NPs 'John; 'J'^; receive the Goal as 

well as Agent interpretatioa and (115). where the subject NPs 'Mary; /J^^: 
receive the Source as well as Agent interpretation: 

(114) a. [Ag.co John ] bought soae used books fro* Mary. 

c. [Afrco -iMifi 1 tm n-otz. 

(115) a. [AfrSo Mary ] sold soae used books to John. 

b. [Afcso ] mi-^mm^ 

c. [Afrso ^.-^^ ] ticm ^fcfistc m-otz. 

(vii) In order not only to optiaize the nuaber of theta-roles available in our 
analysis but also to siaplify the selectional restrictions between theta-roles and 
adpositions. we have left unspecified the adposltitions that co-occur with various 
theta-roles in unaarked cases, but specified in the theta-grids those co-occurring 
with various theta-roles in aarked cases. Thus, the Source preposition 'froa' in the 
theta-role ' [Th (So) Ag] ' for the English verb 'steal', or the These preposition 'of 
in the theta-role '[So (Th) Ag] ' for the English verb 'rob', will be left unspecified, 
yielding sentences such as (115a) and (117a) [1^]. while the aarked selection of the 
Source preposition 'of for the English verb 'ask', and the Theae preposition 'for' 
for the English verb 'beg', aust be specified in their theta-grids as '[<Th. of So> Ag 
]' and '[So. for Th. Ag] '. respectively, yielding sentences such as (118a) and 
(119a) [20]: 

(116) a. The clerk stole [m the aoney ] [so froa the cash register j. 

b. umm [so mm. ]mj It^b i 

c. [so bi»g> ] [ih ^ ] 

(117) a. The aan robbed [so the bank ] [ih of the aoney ]. 

b mm^A [so mmmi imi i-o^m i 

c. [so iS^t^i^ ] [th ^ ) «-^/-c. 

(118) a. I would like to ask {[th a favor ] [so of you ]/ [so X«i ] [n, a favor ]}. 
b mm [co ^ ] « [th iilt ]. 

c. [qd §ic ] [ih mf^ ] ISi^fzi^ (citA?*>5). 

(119) a. I have coae to beg [so Jfou ] [th for help ]. 

b ^ {[oD « ] mm/ [co ^m) ]) [Th Sit ]. 



18. 'Ro' stands for 'Route'. The verbs 'roll; (B)^: ♦t*»('5^t)'l"' have, in 
addition, the theta-grld '[Th (Ro) Ag] ; [Th (Ro) Ag] ; <Th (Ro) Ag>'. yielding 
sentences such as 'John rolled the rock down the hill; 'J'^^^JCfcl^&^^liFKSff i ; 
li^mi^Wt'^XmH>Omt)btz'. 

19. The Chinese and Japanese verbs corresponding to the English verbs 'steal' and 
'rob' are 'ffe; JStf' and Mo', respectively, both of which have the theta-grid 
'[Th (So) Ag]; <Th (So) Ag>'. yielding sentences like (116b. c) and (117b.c). 

20. The Chinese and Japanese verbs corresponding to the English verbs 'ask' and 'beg' 
or© 'it. (It):^' and 'ig|tp. ', respGctlvely. with the thet«-grids '[Go, Th, Ag] ' 
and '<Th. Go. Ag>'. 
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c. Loo 1 hh mf^ ] **tfc. 

(viii) For predicate verbs which appear in the for. of idio«s or phrases »e assign 
theta-grSs to these idionatic or phrasal verbs. Thus the English verbs and verb 
Jhrasel 'die. pass away, kick the bucket, yield up the ghost pay the debt of nature 
an tn onp's lone account are all assigned the theta-grid L Be J . ana so are xne 
Sin\%e°^'erbs a^l ver? '^^^f' Flf^f*^!^!'"^^ rl?"^^^^ 
and verb phrases 'm. M^T^. ^STS, ^^^^^^J^-^,: *ShSi t? '1^ {uB 
n' Similarly. English phrasal verbs such as give (birth/rise) to. look lup 
to/down) ' and 'tak^.. into (consideration/account)' are assigned, respectively, the 
theta-grids '[Go. So]'. '[Th. Ex]' and '[Th. Ag] . 



(ix) The English expletives 'there' and 'it' appear in 'unaccusative' (including 
Sstentia*;^) senTences and 'i-Persor^l' construct io^^^^^ 




or obligatory external argu.ents in their theta-grids. ini^ T^VraHd ' it T 
verbs such as 'raia snow. hail, thunder' are assigned the theta-grid L it j . 
yielding a sentence like (120a). [21] 

(120) a It is {raining/ snowing/hailing/ thundering). 

English existential verbs (e.g. 'be. exist, live, '•^•ain') and unaccus^^^^^^^^^^ 
(e g 'arrive, occur. e«erge'). on the other hand, are assigned the theta-grids [{Lo. 
Th <tdef>/ Th <-def>. Lo. there)] [22]' and '[(Th <tdef>/ Th <-def> (there)}] 
repectively. yielding sentences like (121a) and (122a)[23]. 

(121) a {[n The bock ] is/ There is [m a book ]) on the desk. 

c. ([ti, (^(O) ] o< 7L(D±iz/ o< ;tcDi:U: [it, ^» (HS)]) W^. 

(122) a ILth The accident ] arose/ [ih An accident ] arose/ There 

arose [th an accident ]) froa carelessness. 

b. {([^ (m!^m.!f\^ ]/ [t* ^^-^^.9^ ]) mm-oi^mmj/ 




k, [Th Mffii^(-^) ]] 



As for English "raising" verbs such as 'seea. appear, happea chance', we will assign 
the theta-grid '[{Pd. it/ Pe. Th)][2*]'. yielding sentences (123a'). as well as 



21. The Chinese and Japanese verbs corresponding to the English verbs '"in; J^^^- 
hail, thunder' are 'TlM/^/S). ftS' and ' ^^^/^/^\^^Sf',^,f-^*'^.'^,f. ' 
respectively, and are assigned the theta-grids '[ ]' and <m/WWm)> . yielding 
sentences like (120b) and (120c). 

22. '<+(ief>' and '<-def>' stand, respectively, for '<definite>' and '<indef inite>'. 

23 The Chinese and Japanese verbs corresponding to the English verbs 'be; arise' are 
'* M- Si^' and '%?.; &?>'. respectively, and are assigned the theta-grids ?E 
[Lo! Th'<«lef>], ^ [Th <^ef>, Lo] ; «4 [Hh <tdef>/ (^)Th (4>)]' «nd '^^ <Lo. 
Th>; fiS [ Th ] ', yielding sentences like (121b, c) and (122h.c). 

2A This theta-grid indicates that sentences like (123a') are directly projected fro« 
the theta-grid in our analysis, rather than indirectly derived fro. the underlying 
sentence such as ' [ e see«s [Pi John to be sick ]' by wving the constituent ^subject 
'John' into the .atrix subject position. Since we are interested K>re in the direct 
realization of surface sentences than in the 'proper" derivation of these sentences, 
this slight deviation of ours fro. the traditional analysis of raising constructions 
in the Goverr«ent-and-Binding Theory .ay be excurjible. Incidentally, verbs like 
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(123a) [25], 



(123) a It see«s [pd John is sick ]. 

a', [ih John ] S9eBS [p» e to be sick ]. 

b'. mWr 

c. [pa 



(x) There are a nuaber of English verbs which are converted froa nouns. Thus, 
denoainal verbs such as 'bottle (wine), gut (a fish), knife person)' have relevant 
nouns incorporated in their semantic interpretations, as paraphrased in 'put (wine) 
into a bottle, take out the guts of (a fish), stab (a person) with a knife ', and are 
assigned the theta-grids '[Th, Ag], [So, Ag], [Th, Ag] ', yielding sentences (a) in 
(124) through (126): 

He ] is bottling [ih the wine ]. 

;a« ffi ] jE [th ] s^a/ji}is-?«(ffi). 

:Ag 2^ ] [th I^Si ] )E((D4')ic 8g«)Tc^*. 

She ] has already gutted [so the fish ]. 

Ag gk ] QW {[so iifciMMI 1 *m [n, SM ]/ «to [Th [so MSl ] SK 1) 7. 

Ag m:it 1 {[so m<D4'i3^f> i [th ii(^t>tz^ ]/ 

Th [so 11® ] lii(>t>tz^ ]) JiL*)m^tz. 
Ag She ] knifed [Th hia ] in a rage. 

Ag i* ] { fflzanT [th it i/ m [th At i =:zl i. 
Ag ] nmo^'o zLe. [th i mvtz. 

Note that the English verbs 'bottle, gut, knife' correspond to '^{X/M)W3MiW, 
m.. . . 5S(ffi) (ffiMi/li. . . -jn ' in Chinese and ' ffi((D4>)tC>K y>%. Tr!(c04') 

y b) tj:bt) 4.^*3 g^<,21:gjB'f' in Japanese, each of which is assigned exactly the 
saae theta-grid assigned to the corresponding English verb, ^tote also that the 
incorporated nouns 'bottle, gut(s), knife' in the English predicate verbs appear 
overtly as 'jgf-, RK. 7]^: M. ii<ht)tz. J]' in the corresponding Chinese and 
Japanese predicate verbs, playing the seaantic roles of Goal, Theae and Instruaent, 
respectively. 

(xi) In addition to verbs, adjectives can also be used as predicates and assigned 
proper theta-grids so as to project into sentences. English adjectives 'afraid' and 
'fond', for exa^le. will be assigned the theta-grids '[({Th/Pd)) Ex]' and '[Th, Ex]', 
yielding sentences (127a) and (128a). Siailarly, Chinese adjectives 'ffl; Wti' and 
Japanese adjectives "nTffit,^; tftf^B]- are assigned the theta-grids '[({Th/Pd}) Ex ]; 
[{Th/Pd/Pe) Ex]' and '[(Th/Pe(0} Ex ] '. respectively, yielding sentences like (127b, c) 
and (128b, c). Cbapare: 



(124) a. 
b. 

c 

(125) a. 
b. 
a. 

(126) a. 
b. 
c. 



'(non-existential) be; Ji (NP). (Ji) (AP) ; tz' and 'becoae; (NP)/ 9^ (AP)) ; 

{(NP) \Z/ (AP) OiCCS'. which aay Uke MPs as well as APs as coapleaents, will be 
assigned the theta-grid '[At, Th]', with 'At' standing for 'Attribute' and yielding 
sentences such es 'He {is/ becaae) (a doctor/ rich); ftfe {[Ji/fiE^) <HKi/ {(S/ftWS) 

^;tf?i± m^/^n*>) {/-c/ui^c-pw. 

25. Note that the English verb 'seea' corresponds to a sentential adverb 

't^^R' in Chinese and an adjectival suffix ' hLl^' in Japanese, which are assigned 
the theta-grids '[_Pd]' and '[Pd_]', respectively, yielding sentences like (123b, c). 

26. More precisely, while "nJtflC"'' falls under the traditional category of adjective, 

falls under what we call 'adjectival noun". Moreover, both 'nTffit,">' and 'jijtfr' 
are analyzed as "transitive* adjectives which take 'Yy y^' as external argument or 
subject, and 'Xx i?' as internal argUMnt or object, as illustrated in (127b) and 
(128b). 
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(127) a. [Ex i ] a« afraid ({[Th o f our teacher ]/ [Pd that our teacher wi 11 punish 
us ])). 

b. [Ex g I mj^ ({[Th msm$M ]/ [w mnm^ikattmm jd. 



c [Ex gii ] {[Th ^-.^ti ]/ [Pd ^.^iZ Pf-.i^iaaffl 

(128) a. [Ex I 1 a« fond [Th of i Busic/slnglng songs )]. 

b. [Ex § ] {[Th SHTTPdl^^p ]/ [Pe -fepgar ]). 

c [Ex mi 1 {[Th g^llT/ [Pe M^gMlffiil ]) B¥tt 

Agentive nouns (e. g. 'teacher, student, author, editor; fP^. IS^; 

fP^. 18^^^'). deverbal action nouns (e.g. 'destructioa withdrawal, analysis, 
description;^^ t»M. ^#f. JSit; ^Mf. and locative nouns 

(e.g. 'top, botto«, front, back; Tffi, p®, Qfcffi; T. BU. S^') "ay 

optionally take ThG«e and Location, respectively, as coapleaents. Thus, these nouns 
■ay be assigned the theta-grids '[(Th)]' or J^[(Lo)] '^^ yielding such expressions as 
'the teacher [Th of Engl ish 1 ; 



ishj; [Th l^fla ] ^gf; [Th ] 'the destruction 

rrhpMfi^ JTO; [Th *^M^Am<OZt>V' and 'the top [Lo 
i ] ±ffi; [Lo 0< f i'. 



[Th of the old house 1 
of the desk ] ; [Lo S^l^ ] 

(xii) In addition to verbs, adjectives and nouns, prepositions (Including 
postpositions in Japanese), adverbial particles and conjunctions are assigned 
theta-grids which will project into prepositional (or propositional) phrases and 
subordinate clauses. Locative prepositions and postpositions like ' {at/in/on/ab6ve/ 
under/below/beside/before/after) ; . . W «(ffi)/±(ffi)/T(ffi)/5?Gi)/t3(®)/^ 

(m))); . . . {4>/0{4'/±/T/w/hu/^)){C', for exa«ple, aay be assigned the theta-grid 
'[ Lo ]'; teaporal prepositions and postpositions like ' {at/on/ in/during/ before/ 
after/ti 11}. . . ; {^. . . {(if=fm) mWtm m-. ■):■■■ {U:/4'/(W)t5/(W)ffe/l) '. the 
theta-grid [Ti] ; Instruaental prepositions and postpositions lite '{with/by/by neans 
of)...; (M/^/^/H) ■ ■ ■ : ..."e'. the theta-grid '[ In ]'; and so on, English 
adverbial particles (e. g. 'up, dowa in, out, on, off, over, away') differ fro« their 
hoaophonous prepositions in that while the latter are 'transitive' and thus mist take 
MPs as co«ple«ents, the fomer are 'intransitive prepositions' which cannot take 
coBpleaents and are assigned the theta-grid '[ ]'[27]. English adverbial particles are 
generally aatched in Chinese with locative and deictic coapleaents (e.g. '((Jb/T/JS 
/tB/5§) {3fe/^}/5S} '), and in Japanese with coapleient ste«s in compound verbs (e.g. ' 
{_h5/T'3 5/iitj?/tiif /^S) '). Subordinate conjunctions (e. g. '{when/while/before 
/af ter/t i 1 1/ i f / un less/because) ; {. . . [m^lM/m.) I @J/M/l^l^/Hm) • • ■ ) ; . • • 
{l^/M/t(f/Sfe/j^/V-(r)u fc/V-Ta i?>/V-(r)e i±7^£ltniJ/A>^/OT') [28]), on the other 
hand, differ froa prepositions and postpositions in that while the latter take MPs or 
PPs as coBpleaents. the fomer take clauses (I.e. IPs or TPs) as coBpleaents and are 
assigned theta-grids such as '[Pd, Ti] ', ' [Pd. Co]' and '[Pd. Ca]', with 'Ti', 'Co' 
and 'Ca' standing for 'teaporal clause', 'conditional clause" and "causal clause". As 
for coordinate conjunctions, they aay also be analyzed as seaantic predicates taking 
two or aore than two arguaents that are In principle identical in teras of syntactic 
category and phrasal status (i.e. word (X). seai-phrase (X') and phrase (XP)). Thus, 
we assign coordinate conjunctions the theta-grid '[Xx*]'[29], which will yield such 
OTordinate^constructions as 'John {and/, ) Bill and Dick; 'J^{=ffl/ii<l,^^^^|^^@jt 



/, l^dSPt^JlP'. 'either to go to the office or to stay at hoae; «^S_hi^#S^^; 
tQB "f ^ jy^f^tii j^S ii^' and 'not only pretty but also gentle and intelligent; ^ 

(xiii) The positions of adverbs or adverblals that occur in VPs, IPs (1. q. Ss) or CPs 
(i.e. S's) can also be indicated in the fora of theta-grid. English "non-Ji' adverbs 
(e.g. 'before, behind, afterward. Inside, outside, upstairs, downstairs, around, 
along, abroad, here, there; hard, early, late, (stay) long, (cut) short, fast, (drive) 



27. Or. alternatively, they aay be assigned the theta-grid '[ 4) ] '. Certain adverbial 
particles, howeover, aay take PPs as adjuncts, as illustrated in 'John caae on.1 froa 
behind the tree . ' 

28. For a aore detailed discussion of Japanese conjunctions, see Tang (1993). 

29. The 'Xx' represents a variable in teras of both syntactic category and phrasal 
status, whilt? the superscript star '«' stands for any nuaber that is equal to, or 
larger than, two. 
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slow, (run) deep'), for exai^le, may only appear as rightward adjuncts in VPs and are 
assigned the theta-grid '[V'_l'. while preverbal adverbs (e.g. hardly, scarcely, 
simply, merely. Just. not. never '[*}) and degree adverbs (e.g. 'deeply, badly, 
entirely (agree), fully (understand); terribly (sorry), perfectly (natural), utterly 
(wrong)') may only appear as leftward adjuncts in VPs or APs and are assigned the 
theta-grid '[_{V'/aT]'. As for '-l^ .anner adverbs (e.g. 'slowly, rap dly. 
carefully, cautiously, diligently, happily, sadly'), which say appear as either 
ririitward or leftward adjuncts in VPs. and sentential adverbs (including style adverbs 
(eg '(to speak) frankly. honestly (speaking)'). viewpoint adverbs (eg. 
'theoretically, linguistically, technically'), aodality adverbs (e. g. possibly, 
perhaps, certainly, undoubtedly') and evaluation adverbs (e. g. surprisingly, 
regrettably')), which u&y appear sentence-initially or sentence-finally, are assigned 
the theta-grids '[_V'_]' aiv: ' [_C'_] ', respectively [31]. 

5. The Projection of TheU-grids and its Constraints 

In projecting the contents of theta-grids into surface sentences, we «ist observe 
the following principles or conditions: 

(i) The Projection Principle requires that the arguaent structure and the«atic 
property of the predicate verb^ adjective and noun project to all the three levels of 
syntactic representation: D-structure, S-structure and Ugical For* (LF). In our 
approach, however, the only relevant level of syntactic representation is surface 
structure, and the theta-grids of predicate verbs, adjectives and nouns will project 
directly into surface sentences[32]. 

(ii) The Iheta Criterion requires that each argument be assigned one and only 
theta-role, and each theta-role be assigned to one and only one argument. This will 
guarantee not only that all obligatory arguaents are present in sentences, but also 
that no redundant or il legitimate elements appear in sentences. Optionality of 
certain arguK^nts, on the other hand, is specified as such by the use of parentheses 
in the theta-grid. 

(ill) The Canonical Structure Realization Principle states that each theta-role is 
■apped on to its canonical syntactic constructioa Agent, Experiencer and Benefactive. 
for exaaple. are typically realized as hunan or animate NPs; Theme, as concrete or 
abstract NPs; Quantity, as QP; Goal, Source, Instrument, Location and Time, as PPs; 
and Proposition, as various types of clauses as specified in theta-grids. Furthermore, 
the selectional restriction between the adposition and the NP in each theta-role that 
is realized as a PP is predicted or handled by lexical redundancy rules (e. g. P ~> 
to/ t^W) in unmarked cases and explicitly specified in the theta-grid in marked 



cases. 



(iv) The X-bar Convention defines the well-formedness condition on the hierarchical 
structure of, and the dominance relation aaong. constituents of phrasal constructions, 
as stated in (129); namely, all syntactic constructions are endocentric in structure, 
binary in branching, and can be recursively generated or licensed if necessary: 

(129) a. Specifier Rule: XP ~> XP, X' 

b. Adjunct Rule: X' ~> XP, X' (recursive) 
c Complement Rule: X' ~> XP, X 

Basically, (129a) sUtes that any phrasal category or maximal projection 'XP' consists 
of a "semi -phrasal' or intermediate projection 'X" and a specifier, which can be of 
any maximal projection (including null): (129b) states that the intermediate 



30. We will not discuss here whether the negative 'not' should head its own 
projection. 

31. For a much more detailed discussion of English adverbs and adverblals appearing in 
various X-bar structures, see Tang (1990b). 

32. Thus, the Projection Principle may be replaced by or subsumed under the Full 
Interpretation Principle (Fl), which requires that every element of PF and LF »ist 
receive an appropriate Interpretation or be properly licensed. 
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projection 'X" in turn consists of another intermediate projection X and an 
Sdjunct. which can be of any wxiwl projection and. furthemore can «r%than one 
in nu-ber since the rule is recursively applicable; and (129c) states that the 
intemediate projection 'X" also consists of the head word X and a co.Ple.ent XP 
[33] 'X' represents a variable that ranges over lexical categori s such as N(oun). 
V(erb) A(djective). P (reposition). Ad(vGrb) and functional categories such as 
C(o.ple«entizer). I(nf lection). D(etor.iner) and Q(uantifier)[3*]. For ej^e of 
expos Uon we will si.ply assu.e that, in the surface structure, subject NPs of 
seSeiJ^r'ap^Lar in the 's^lfier position of IPs; "^jf ^1^^, *",^„f'-Pi«'?^^ Zm^Zt 
NPs) of predicate verbs (or adjectives) occur in the co.ple.ent and the adjunct 
positions of VPs[35]. respectively; and various adverbials are placed in the adjunct 
Ssit oSs of al kinds of XP. Si.ilarly. object NPs. PPs or IPs of adpositions 
aSludiL prepositiOTTs. postpositions and subordinate conjunctions) appear as 
Ji^Ss Sf tteJe adp^Jsit'fons^ Within NPs. however co.plei«nt PP^^ ar^appositlona 
closes occur as co.ple.ents of the head noun, while relative clauses and other 
adjectivals appear as adjuncts[36l. 

M While the hierarchical structure of the constituents for.ing various XPs is 
defied by X-lSr (S^ention. the linear order of these constituents is deter-ined 
by the following conditions and paraieters: 

(130) Case Filter 

Phonological ly realized NPs «ist be assigned Case[37]. 

^'''^ ?Tnj!?sn?S\2TslSed accusative Case and oblique Case by transitive verbs 
and prepositions, respectively, fro. left to right[38] 
b Chinese NPs are assigned accusative Case and oblique Case by trans tive verbs 

(and adjectives) and prepositions, respectively, fro. left to right, 
c. Japanese NPs are assigned Case by postpositions fro. ri^t to left. 

^'''^ !SirnS?t2S*irintervene between the Case-assigner (e. g. transitive verbs 
transitive adjectives, prepositions, postpositions) and the Case-assignee (i.e. 
NPs). 



33 There are several different versions of X-bar conventioa See versions allow 
XP^adjunction in the phrase structure (i. a 'XP -> XP. XP'. as insUntiated in 

Urson's (1988) VP-shell analysis), while others allow functional categories to 
project to two-bar levels (i.e. 'XP') but prohibit lexical categories fro. doing so 
(e g. Fuki's (1986. 1988) relativized X-bar theory). 

34. Note that while lexical categories are generally provided with argument 
structures, functional categories do not see. to have argu«nt structures. 
35 The VP-internal Subject Hypothesis assuiKS that the subject NP originates in the 
specifier position of the VP in D-structure but »ves to the specifier position of the 
IP in S-structure to acquire no.inative Case. Larson (1988). on the other han<i, 
proposes that the subject NP. the object NP and the co.ple«nt PP (or NP) appear in 
the specifier position of the VP-shell, the specifier position of the VP. and the 
cott)le.ent position of the VP. respectively, in I>-structure. with subsequent .oveaent 
of the subject NP to the specifier position of the IP to receive no.inative Case and 
that of the predicate verb to the head position of the VP-shell to assign accusative 
Case to the object NP. 

36. For X-bar structure analysis of various phrasal categories in English and Chinese, 
see Tang (1989b. 1989c. 1989d, 1990b. 1990c 1990d). 

37. There are also linguists who clai. that, in addition to NPs. IPs (i.e. Ss) are 
assigned Case by co.ple.entizers and subordinate conjunctions. 

38. ie will assu.e that subject NPs in English and Chinese are assigned no.inative 
Case under co-lndexatlon with the head I of 'P. 
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(133) Arguaent-Placeaent Para»eter[39] 

a. English predicate verbs and adjectives place their internal arguaents (i.e. 
objects and coapleaents) and semantic arguaents (i.e. adjuncts or adverbials) 
on their right, and external arguaents (i.e. subjects) on their left. 

b. Chinese predicate verbs, adjectives and nouns['*0] place their internal 
arguaents on their right, and external and semantic arguaents on their left. 

C. Japanese predicate verbs and adjectives place all kinds of argument on their 
left. 

(134) Modifier-Placeaent ParaaeterL"*!] 

a. English non-single-word aodifiers of nouns (including "bare-NP' adverbs, 
prepositional phrases, infinitival phrases, participial phrases, appositlonal 
clauses, relative clauses), adjectives and adverbs (e.g. infinitival phrases 
and coaparative phrases) appear to the right of the head noun, adjective and 
adverb, while single-word aodifiers (e.g. determiners, quanti f iers[-^, 
adjectives. adverbsE'*^], participles and nouns) appear to the left. 

b. Chinese aodifiers of nouns, adjectives and adverbs all appear to the left of 
the head noun, adjective and adverb[+*]. 

c. Japanese aodifiers of nouns, adjectives and adverbs all appear to the left of 
the head noun, adjective and adverb. 

The X-bar Convention (129), coupled with the conditions and parameters listed in 
(130) through (134). will generate or license the hierarchical structure of 
constituents and the linear order aaoiig thoa. as illustrated in the English, Chinese 
and Japanese sentences below. Coapare: 

(135) a. I study linguistics in the university. 



39. This is a rather stipulative variation of the Head Paraaeter (i.e. a choice 
between "Head-Initial' and 'Head-Final) or the 6-assignient or 6-aarking 
Directionality Paraaeter (i.e. choice between 'Froa-Left-to-Right' or 'Froa-Right -to- 
Left'). 

40. Certain predicate nouns aay occur without predicate verbs in Chinese, as 
illustrated in '^XMS'. '^SIA' and 'ttH/fil^? ' • These predicate nouns are 
attributive in nature and do not seea to require Case. 

41. This is again a different version of the Head Paraaeter. 

42. Expressions like 'plenty of, lots of, a great many, a good deal of, a large number 
of are treated as lexi call zed quantifiers, which are equivalent in lexical status to 
single-word quantifiers like 'many, much. half. ' 

43. Adjectives may be modified by degree adverbs (e.g. 'very, so, too, rather') and 
quantifiers (e. g. 'ten years (old)'). Since degree adverbs (including coaiparative 
phrases) and quantifiers are mutually exclusive, they may be generalized into a single 
category. 

44. DQscriptlvo and rosultatlvo complomonts. along with a fow degree adverbs (e.g. 
'• • {ffi7/^tS/W^^7) '). however, follow the head adjective. 
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In the English sentence (135a), both the object NP 'linguistics' and the locative 
adjunct PP 'in the university' appear to the right of the predicate verb 'study', as 
defined in the Argu«ent-P laceaent Parameter (133a). Moreover, the NPs 'liguistics' and 
'the university' receive accusative and oblique Case, respectively, froe the preceding 
transitive verb 'study' and preposition 'in', while the subject NP '1' receives 
noBinative Case under co-indexation with the inflectional head I, as defined in the 
Case-Ass ignaent Parameter (131a), thus satisfying the Case Filter (i30). Next, in the 
Chinese sentence (135b) the object NP 'WiB^' appears to the right, and the locative 
adjunct PP to the left, of the predicate verb 'tH', as defined in (133b). 

Like English NPs, the Chinese NPs and receive accusative and oblique 

Case, respectively, fro« the preceding transitive verb 'jj|' and preposition '^£', 
whereas the subject NP '^' receives nominative Case under co-indexation with the 
inflectional head I, as defined in (131b), thus satisfying (130). Finally, in the 
Japanese sentence (135c) the subject NP '|^»', the object NP 'MM^' the 
locative adjunct PP '-^SfiX' all occur to the left of the predicate verb 'tfiSft'ti', 
as defined in (133c), but unlike English and Chinese NPs, all the Japanese NPs '|/,', 
-^1^' and 'mM^' receive Case froa the following topic-«arker postposition 'l±'. 
locative postposition 'T' and object-iarker postposition '$', respectively, as 
defined in (131c), thus satisfying (130). As Cases in Japanese are assigned by 
propositions and have nothing to do with predicate verbs, "scraabling' of arguments is 
freely permitted in Japanese. In English and Chinese, by contrast, the word order is 
rather rigidly fixed. CoBpare: 

(136) a He studied English diligently at the library yesterdayt^]. 
b. Yesterday he studied English dillgentlY at the library. 



Ab. Yesterday', 'B^^' and 'B^S ' are adverbs in nature and do not seen to require 
Case. 
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c. 



Yesterda y at the library he studied English diligently. 



(137) 



b. 
c. 



(138) 




a 
b. 




c. 



d. 



f. 



While English and Chinese allow only teaporal and locative adverbials to occur 
sentence-initially as theeatic adverbials, as exeaplified in (138) and (137), all the 
phrasal constructions (i.e. 'feli', B^S '. 'MW'^'. '^L^^'[^]. '1511*') in the 
Japanese sentence (138) can be freely scraabled and appear in different word orders. 

(vi) We have shown how the theta-grids of predicate verbs and adjectives can be 
projected into surface sentences without recourse to aoveaent transforaations. There 
are certain language-specific changes in word order, however, that aust be handled by 
the rule of 'Move a'. English has. for exaaple, Auxiliary Preposing, which aoves an 
auxiliary verb' occurring in the head position of VP or IP to the head position of CP 
((139a)); WH-Fronting, which aoves a wh-phrase in a direct or indirect question to the 
specifier position of CP ((139b)); Extraposition froa NP. which aoves an appositional 
or relative clause and adjoins it to the right periphery of VP ((139c)); Heavy NP 
Shift, which Boves a clausal object and adjoins it to the right periphery of VP 
((139d)); and Topi calizat ion, which aoves a topicalized phrase to the specifier 
position of CP ((139e)): 

(139) a [cp [ip John [i wUl ] do it ]]. ~> [cp [c WiU ] [ip John t do it ]]? 

b. [cp I don't know [cp John will do what ]]. ~> 
[cp I don't know [qp what [tp John will do t ]]]. 

c. [cp [w The ruaor [cp that Mary has eloped with John ]] is going about in the 

vi llage ] ]. ~> 

[cp [w The ruaor t ] is [yp [vp going about in the village ] that Mary has 
eloped with John ] ]. 

d. [cp You can [vp say [cp exactly what you think ] to hia ]]. — > 
[cp You can [yp [vp say t to hia ] exactly what you think ]]. 

e. icp [ip I aa not fond of [hp his face ]]; [cp [ip I despise [«> his 

character ]] ] — > 

[cp [m> his face ] [ip I aa not fond of t ]] ; [cp [hp his character ] 
[ip I despise t ]]. 

Chinese, on the other hand, has Object Preposing. which aoves the object NP to the 
left of a transitive verb or adjective and assigns it oblique Case with the 
preposition ((140a)) or 'ff* ((140b)), and Topical izatioa which aoves a 

topicalized phrase and adjoins it to IP ((140c)): 



(140) a [cp m IvP H [vp *^ [np ft ]]]. -> [cp a [yp S [vp [pp m 1 t ]]. 

b [cp ft^ [vp um [w m ]]]. -> [cp ^ [vp [pp 1 [yp mm t ii. 

c. [cp [ip a^SDC [hp Miffifi. ]]], [cp [ip mPfH [* MlEA 111 -> 

[cp [ip [kp mmsu. [ip ^«»: t ]]]. [cp [ip [w im&AJ. 
[ip mpf^a t ]]] 



46. The 'IZ' following the adjectival noun 'H^-WC ' aay be analyzed either as an 
adverbial suffix or as a postposition functioning like the English preposition 'in' in 
such expressions as 'In earnest' and 'in peace'. 
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As for Japanese, alnost all word-order changes can be accounted for by the rule of 
'scraabling', as discussed in (138). 



6. Conclusion: Implications for Contrastive Analysis, Language 
Typology and Machine Translation 
Since not only arguaent structure and thematic properties but also syntactic 
idiosyncracies of predicate verbs and adjectives are explicitly and econoaically 
specified in the fom of theta-grids, and are projected into surface sentences in 
quite a straightforward Banner, our a^jroach facilitates comparison between individual 
languages not only in teras of lexical entries, but also in teras of surface word 
order Corresponding 'verbs of trading' in English, Chinese and Japanese, for exaaple, 
can be coapared with regard to their theU-grids and surface realizations, as 
illustrated in (141) through (145): 

(141) a 'spend': vt. [Qc (on Th) Ag] 

tLig/NP John 1 spent [qc/hp fifty dollars ] (tth/PP on t-he book])]. 

b. '^£(»)': vt. tQc (^7 Th) Ag] 

[(WW 'i^M ] (ttvpp ST mm ]) =f6T [qc/w S+^ll. 

c. 'K9': vt. <Qc (Th {(Dtzi^)iC) Ag> 

[{Ag/fv *fiPl* 1. ([th/pp Z<D:$i(D{rzm^ 1) t<te/NP S+H ][*'^] ^-^tz I 

(142) a 'pay': vt. f<Sc (Go)> (for Th) (Ag,So)] 

tLig/icp John ] paid {[qc/mp fifty dollars] [gd/pp to Mary ]/ 
tco/ip Mary ] tQc/iP fifty dollars ]) (ttnM' for the book])]. 

b. '#': vt. t<gc (Go)> (M7Th) (Ag,So}] 

[[w* ] iTh/pp (ST^4«)] #T {t«c/NP K+pm ] (twpp i^'m ])/ 

[qo/ip ] toc/w ])]♦ 

c. 'if^y -. vt. <(Qc) (Th i(Drztb)i^) (Go) {Ag.So)> 

[hg/fp ] (tivpp Z<D:^(Dtztblc ]) ([od/hp tEf^ 1) toc/w H+R ] 

tz ]. 

(143) a 'buy': vt. [Th (So) (Qc) {Ag,So}] 

ttAg/HP John ] bought [rh/te the book ] ([so/fp froa Mary ]) 
([qc/pp for fifty dollars ])]. 

b. 'M'- vt. [Th (U Qc) (So) {Ag.Go)] 

tWNP 'hm ] ([«c/pp mi+mt d (tso/pp m-m ]) ht [ww ]i. 

c. '|R>' -- vt <Th (Qc) (So) {Ag,Go)> 

[\m/pp :mt^ ] ([qc/pp S+RT ]) (Iso/pp tef-A^^ 1) tth/PP ceo** 1 

(144) a 'sell':"vt! [<lh (Go)> (Qc) {Ag, So) ] 

[hg/KP Mary ] sold {[th/pp the book ] (too/pp to John ])/ 
Igo/hp John ] tm/itP the book ]) ([qc/pp for fifty dollars])], 
b. 'SI': vt. [Th (Go) (U Qc) {Ag.So}] 

tUl/HP d^ll ] ([qc/pp feil5:+^ ]) ITT [Th/H- ] (tco/PP ^'J^ ])]. 

c 'W%': vt. <Th (Qc) (Go) (Ag.So)> 

[[a^pp =f6^*« ] ([qc/pp S+H-e ]) ([qo/pp *jlPtC ]) [tk/pp CCO** 1 

W-^Z-c 1. 

(145) a 'cost': vt. [(So) Qc, Th] 

[[Ag/NP The book ] cost ([so/np Mary ]) [qc/np fifty dollars ]]. 

b. 't6': vt. [(So) Qc, Th] 

[[Th/»p mm ] m ([w* u [qc/np s+^ei ii. 

c. '/p*^5'-. vi. [QcThlL-**] 

[tth/PP Cco*tt J tth/NP E+H ] -h'-h'-otz 1. 

Likewise, 'ditransitive' verbs ((146) through (150)). verbs with Locative as 



47. Note that while an object NP requires the presence of '<fe' (e.g. '/"c< cSA^ii^^M 
9'), a quantifier phrase 'H+R' dispenses with it. 

48. Note that parentheses, rather than angle brackets, are used for this intransitive 
verb, indicating that no perautation in the surface word order is peraltted between 
the two arguaent s specified in the theta-grid. 
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'transposed' subject ((151)), 'unaccusative' verbs ((152)), 'ergative' verbs ((153)). 
'meteorological' verbs ((154)). 'raising' verbs ((155)) and 'control' verbs ((156) 
through (158)) can be coapared in a similar fashion, as illustrated below: 

(146) a. 'forgive': vt. [Be (Th) Ag] 

[ Please forgive [ee/w us ] ([tVw our trespasses ])]. 

b. 'WM'- vt. [Be(9^Th) Ag] 

c. '^f ' ■■ vt. <Be(OTh, Ag> 

[ rjiizt^ [pp [jte/PP UM<D ] Wnp H ] * ] ^btcipi ]. 

(147) a. 'envy': vt. [Be (Ca) Ex] 

[[sxJKP John ] envied [bb/h. Bill ] [(Vnp his good luck])]. 

b. 'mB'- vt. [Be (&^Ca) Ex] 

c. 'm.t^, %?>': vt. <Be(OCa, Ex> 

^ ^ [[fix/pp 1 [» [WH' 2>ifiP« ] [c/rp ^ill * ] ^A/-c ]. 

(14«) a. 'give': vt. [<Th, Go> {Ag,So)] 

[[Ag/HP John ] gave {[th/hp a present ] [qo/hp to Mary ]/ 
[co/HP Mary ] [tVhp a present ]) ]. 

b. '^': vt [Go, Th (Ag.So)] 

[[^HP 1 [oo/ip 'J^ ] [tv* -imUs 

c. '±lfS' : vt. <Th, Go (Ag.So)> 

^ ^ [[a«/hp ] [co/pp t6-?lC ] [ih/pp T'l/if^ ] ±l.ftc^ 1. 

(149) a. 'send': vt. [<Th, Go> {Ag.So}] 

[[Ag/HP John ] will send {[ib/vp sobg cookies ] [a,/pp to Mary ]/ [co/h. Mary ] 
[th/Hp so«e cookies ])]. 

b. '^': vt. [<Th, (J&)Go> {Ag.So}] 

[[as/hp ] {[tvhp ] [go/pp js&d-if ]/ [cD/pp/F(P (.i^)'hm ] 
[Th^o' ]}]. 

c. '^S': vt. <Th, Go {Ag, So) > 

^ ^ [[Ag/pp ;fcfiP*>' ] [co/PP t£-?u: ] [Th/PP k,-X^r>^h.g ] ^^fc]. 

(150) a. 'introduce': vt. <Th, Go (Ag,So}> 

[[ak/hp John ] introduced [rt/w Mary ] [oa/pp to Bill ]]. 

b. '^': vt. [Th.Go,Ag] 

[[a^hp ] [Th/ip I [co/pp is&d^wi ]]. 

c. mt-r^'- vt. <Th,Go,Ag> 

^ ^ [[Ag/pp :fc:ap*« ] [co/pp eJ^u: ] [th/pp ] la^tfc ]. 

(151) a. swara' : vi. <Th, Go, Ag> 

[[u/HP The garden ] is swaraing [tvpp with bees ]]; 
[[tVjcp Bees ] are swaraing [u,/pp in the garden ]]. 

b. '^m'- vt. [Th, Lo] 

[Wkp ] imr [iwip stt ]]. 

c. 'l^iji^': vi. <Lo, Th> 

[[tvpp ] [lo/pp liU: ] lt)!)«oTl>5 ]. 



49. For speakers who accept 'd^^J— ^jfil^M'J^' as well-foraed. the theta-grid 
will be '[<Th, Go> {Ag, So)]'. 

50. Japanese ditransitive verbs have a rather coaplicated systea of deixis, which can 
also be incorporated in the theU-grid (e.g. 'JilfS (vt. <Th. Go (Ag, So) <-!», 
_h{fS (vt.<Th, Go {Ag, So) <-I, +H»), <nS (vt. <Th. Go < + !> {Ag. So) <-H»), 

S (vt. <Th. Go <+!>, (Ag. So) <+H»), %o (vt. <Th. So <-I, -H>, (Ag. Go) >, t^/r/^< 
(vt. <Th, So<-I, +H> (Ag, Gc)», -^5 (vt. <Th, Go <-I, -H>, (Ag, So)», <nT-?'S (vt. 
<Th, Go <-I, -H>. (Ag. Go)», where '+!', '-!', '+H', '-H' stand for 'first-person', 
'non-first-person (i.e. second and third persons)'. ' honorific (or superior)' and 
'non-honorific (or inferior)', respectively. For a aore detailed discussion of 
Japanese deixis, see Tang (1993). 
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(152) a. 'arrive'-: v. [51] [Th (there)] 

[[th/HP A guest ] arrived yesterday] ; 
[[There] arrived [tVip a guest ] yesterday ]. 
'M, 3*S': V. [(^)Th (*)] 

[[th/NP ^-&.^A ] BiF5^ mm J ]; [ 01/3*) 7 [ih/NP -{a^A ]]. 

c '*<. ^mt^': vi. <Th> 

[ B^B [th/PP *5«^A,# ] -A mi^tzmmvtz )]. 

(153) a. 'open': v(t). [62] [Th (Ag)] 

[ [ak/np John ] opened [th/NP the door ] ] ; 
[[rh/K> The door ] opened (autoaatlcally)]. 
'itm)'- v(t). [Th (Ag)] 

[[WNP ] nm [TV.NP PI ]]; [[tvfo. p5 ] mwm) tjMj i 

c m<': v(t). <Th (Ag)> 

[[Ag/PP ] Iwpp ] m^tz ]; [[wpp PA^' ] mwmiz) ^i^tz ]. 

(154) a. 'rain': vi. [ it ] 

[[ It ] is still raining ]. 

[„p s^T ]; [ mr mu. 

c 'Pis': vi. <^> 

[[ ma 1 *m-pTi^s]. 

(155) a. 'happen': vi. [{Pd. it/Pe, Th)] 

[[ It ] happened [p^/cp that she was at hoae ]]; 
[[ih/NP She ] happened [pe/cp PRO to be at hoae ]]. 
b- '5^15': ad. [_C'] 

c 'im:- ad. [_C'] 

[cp arnir^ &*i± ]/ [ih/pp &^i± ] im i i^tcjg/-c ]. 

(156) tL 're«eBber': vt. [ {Pe, Ag/ (Pe/Pg/Pd) Ex)] 

[[ Rewaber [pb/cp PRO to turn off the light ]]; 
llsxjie I re«G«ber {[eb/cp PRO seeing hii once ]/ [gg/ip hia 
saying that ]/ [w/cp you went to school with hi« ])]. 
'iB^'(il)': vt. [{Pe {Ag/Ex)/Pd. Ex)] 

[ le^ [p«/cp PRO mm u -. 

[[bx/np m 1 mn [p^/cp PRO ^mi-A u-. 

iis^Kpm ]Bn\^^{ mmmm/ )]]. 

c. ,"Sni*tC...-rS. sSBLXl^^': vt. <{Pe. Ag/{Pe/Pd} t. Ex } > 
[ ^.tltlZ b^/cp PRO «jS*J^L^C$-l^ ]]; 
[[exM- 1 [pp [p»/cp PRO ] t ] EtlLTi^?. ]; 

[[exM- iiti ] [pp [pa/CP { ^/u^Cdt^sofc/ ^*«&i:-HK{cJ|l«S!U: 

(157) a. warn : vt. [Go. Pe, Ag] 

^li^.^ ^ 1 warned [go/np ae ] [ft,/cp PRO not to see his daughter any aore ]]. 
b. 'lf@': vt. [Go. Pe. Ag] 

[D^Np m { SF^ [(»,/«. ^]/ [co/pp ] ) [P./CP PRO ^F3gS^ftfe:ic% ]]. 

c If^TS : vt. <Go. Pet. Ag> 

^ ^ [[Ag/PP ] [go/pp ] [pp [p,/cp PRO ft^tOjftlC^^^ ] t ] ^^btz ]. 

(158) a. 'proaise': vt. [Go. Pe, Ag] 

[[a«/np She ] proaised [go/h. ae ] [pe/cp PRO to buy ae a new bicyle ]]. 

b. 'M': vt. [Go, Pe. Ag] 

] «p« [co/HP m ] [f^/cp pro •jii--mmmm, u. 

c. l^^-rS': vt. <Go. Pet, Ag> 

[co/H» li^^ ] [pp [p«/cp P(P)RO ?Sf Ll'^QKf^^H'pT ^ tlS ] 



51. 'v. ' stands for an unaccusative verb which is capable of assigning partitive Case 
to the NP that follows It. 

52. 'v(t). ' stands for an ergative verb which can be used as an "inchoative 
intransitive" verb as well as a "causative transitive" verb. 
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t ] ll^^Ltz ]. 



The above examples further illustrate how the argument structure of a predicate verb 
can be specified in the for* of a theta-grid according to how aany arguments the 
predicate verb licenses and what seaantic role (i.e. theta-role) each argu^nt 
receives The association between assigned theta-roles that represent the 
's(e«antic)-selection' property of tlie predicate verb and syntactic categories that 
these semantic roles turn into in surface sentences is to a large extent predictable 
by the "Canonical Structure Realization" or "c(ategorial)-selection tendency of 
theta-roles. Thou^ the association between assigned theta-roles and arguaent 
positions (i. e. internal arguaent, external argument or seaantic arguaent) is also to 
a certain extent predictable (e. g. in an active sentence, Agent always becoaes 
subject), we place the theta-rcle that represents the internal arguaent (i.e. object 
or coapleaent) at the left periphery of the theta-grid, and the theta-role that 
represents the external arguaent (i.e. subject), at the ri^t periphery of the 
theta-grid. We also specify the subcategorial feature of predicate verbs (e.g. vl. , 
'vt. '. 'v(t). ', 'v. ') to account for the difference in their Case-assigning capacity. 
Whatever idiosyncratic properties, lexical or syntactic, aay exist with regard to 
particular predicate verbs, they are also specified in their theta-grids. Thus, the 
lexical entries of corresponding verbs between different languages can be explicitly 
and econoaically coapared in teras of the nuaber of arguaents they license, types of 
theta-role assigned to these arguaents, syntactic categories and graaaatical relations 
these arguaents are associated with, along with such idiosyncratic syntactic features 
as inherent Case-aarking of coapleaents, aarked choice of adpositions, and 
perautabi 1 ity between coapleaents, between adjuncts or between a coapleaent and an 
adjunct. Our analysis shows that aaong the contents of the theta-grid, the arguaent 
structure, theaatic property, and selection of internal and external arguaents with 
regard to the corresponding predicate verbs between different languages are 
essentially the saae, and that what aay differ froa each other is the 
subcategorization feature (and the resultant difference in Case-assigning capacity), 
and certain idiosyncratic syntactic features aentioned above. The mapping of 
theta-grids to D-structures and/or S-structures is also quite si^le and 
straightforward. The X-bar Convention, replacing phrase structure rules, or rather 
serving as well-foraedness conditions on various syntactic constructions including 
sentences, will provide us with appropriate hierarchical structures, into which 
various arguaents specified in theta-grids plus those introduced by lexical redundancy 
rules can be inserted. We can look upon our structural tree projected froa the 
theta-grid of a predicate verb or adjective as a kind of Chri-taas tree which has an 
"endocentric' trunk and 'binary' branches, and all sorts of aibU«ents available serve 
as Christaas decorations to be hung on their proper positions in the tree according to 
the instructions given in the theta-grid. The linear order of the arguaents, on the 
other hand, is largely an issue of paraaetric settings which include paraaeters of 
Case-ass igraient and arguaent -placeaent (or theta-aarking) directionality[S3l. In 
generating or licensing surface sentences, aoreover, principles of universal grasaar 
such as the X-bar Conventioa the Projection Principle, the Full Interpretation 
Principle, the Theta Criterioa the Case Filter, the Adjacency Condition and the 
Econoay Principle (i.e. aoveaent as "the last resort") aust be strictly observed 
across languages. Thus, while language-particular distinctions are chiefly accounted 
for by fixing the values of relevant paraaetersl*^], cross- linguistic siallarities 
follow froa sharing the saae principles of universal graaaar. 

At a aore abstract and general level, language typology can also be discussed in 
teras of universal principles and paraaeters. The distinction between the so-called 



53. Note that we have placed the internal and external arguaents at the left-periphery 
and right-periphery of the theta-grid, respectively, and also indicated the 
perautabi lity of arguaents by enclosing thea in angle brackets, which provides further 
inforaation for the linearity of the arguaents involved. 

54. For other principles and conditions of universal graaaar, see Tang (1989d, 1990a, 
1991a, 1992d. 1994a). 

55. The periphery of a particular graaaar, which is responsible for aarked 
constructions in each language, also contrilwtes to language-particular 
idiosyncracies. 



ERIC 



-130- 



Ui 



'SVO language', 'SOV language' and 'VSO language', for exai^Jle. can be ascribed to the 
difference In Case-ass Ignaent and theta-«arklng directionality as well as the relative 
positioning of the Inflection 'I' and its coapleaent VP. Consider English, a typical 
SVO language, in which the coapleaent VP appears to the right of the inflection head, 
and the subject NP originates in the specifier position of the VP but raises to the 
specifier position of the imedlately doiinating IP so as to acquire noainative Case, 
while the object NP appears to the right of the transitive verb to receive accusative 
Case, yielding an SVO sentence, as Illustrated in (159): 

(159) IP 

/ \ 

[hp ] r 

A / \ 
I VP 
-ed / \ 
A NP V 
John / \ 
I V NP 
I ' study English 



Next, in Chinese, which is also an SVO language with an SOV variant, the coaplenent VP 
also appears to the right of the inflection heiw with the subject NP raising froB 
[Spec, VP] to [Spec, IP], and the object NP appearing to the right of the transitive 
verb, so as to acquire noainative and accusative Case, respectively, yielding an SVO 
sentence (160a) or. alternatively, the object NP adjoins to the periphery of the 
iMediately doainating IP, yielding an OSV sentence (160b), with subsequent adjunction 
of the subject NP to a newly created IP, yielding an SOV sentence (160c) [56] : 

(160) a. IP 
/ \ 

[kp 1 r 

A / \ 
I VP 

"7 /V 

A NP V 
I V NP 



b. IP 
/ \ 
NP IP 

^^Hii / \ 

NP I' 

I \ 

1 VP 

/ \ / \ 

V r NP v 
«^ 7 t. / \ 

V NP 
tj U 



56. The object NP introduced by the preposition 'iC' and receiving oblique Case froa 
it Bay also appear between the subject NP and the predicate verb, as illustrated in 
d'^ffi^KlSJi^T '. which also aanifests the apparent SOV order. 



-131- 



IP 

/ \ 

NP IP 

'J^. / \ 
NP 

NP 
t', 



IP 
\ 

r 

/ \ 
I 

/ \ 

V I ' 

7 



VP 

/ \ 

NP V 

t. / \ 
V NP 
tj tk 



Finally, in Japanese, which is a typical SOV language, the co«ple«ent VP a^jears to 
the left, rather than the right, of the inflectional head, and the head verb appears 
to the right, rather than the left, of its co«pIe«ent and specifier. Since all NPs, 
including the subject NP, receive Case fro* postpositions that follow then, no 
■ovenent seess to be necessary for reasons of Case-ass igment, yielding an SOV 
sentence (161a) or, alternatively, by adjoining the object NP to the IP, yielding an 
OSV sentence like (161b): 



(161) a. 



A 



/ 



IP 



\ 



I 



/ \ 
VP I 

V A 

/ \ ' 

[57] pp V ' 

/ \ / \ 
NP P PP V 

1 NP p r 



/ 

NP 



I 

PPk 

\ 



IP 



\ 



IP 
/ \ 
P PPl I' 

* / \ / \ 

NP P VP I 

:m> fs^ I \ I \ 

pp V Vj I 

ti / \ «i5§lb tz. 
pp V 
tk tj 

FUrtheraore, Choasky and Lasnik (1991:35) point out that if the verb in a tree 
structure like (162a) raises to the head position of IP and the subject NP re«ains in 
the specifier position of VP, then we have an instantiation (162b) of a VSO language 

[58]., 



57. We are by no aeans sure that the subject NP should be raised to [Spec, 
IP], since there seeis to be no 'self-serving' purpose for such aoveaent at 
S-structure. The aoveaent aay be necessary, however, at LP to check Spec-Head 
AgrGeacnt. 

58. Choasky and Lasnik (1991:35) propose that while V raises to I at S-structure, ita 
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1 /a o 



(162) 



[np 



IP 

/ \ 

r 



/ 



\ 

VP 

/ \ 

NP V 

/ 

V 

I 



\ 

NP 



IP 
/ \ 
[hp ] I' 

/ \ 
I VP 
/ \ / \ 
Vi I NP V 
/ \ 
V NP 

ti 

I 



Next, let us exaaine the cross- linguistic variation in WH-Fronting, an 
application of the rule Move a (or aore generally, Affect a). English differs froi 
Chinese and Japanese in that it aoves the wh-phrase or question-phrase to the 
periphery of the proposition (Bore precisely, to [Spec, CP] in the case of English) at 
S-structure, while the latter do so only at LF ^ (perhaps, by adjunction to IP) to 
indicate the scope of the question-phrase. In the case of a aultiple wh-questioa in 
vfhich acre than one question-phrase occurs, English allows only one question-phrase to 
■ove by S-structure while the others reaain 'in-situ' at S-structure and aove to 
clause-peripheral position at LF. Thus, languages aay differ in (i) whether they allow 
overt or visible aoveaent of question-phrases at S-structure. and (ii) if they do, how 
aany question-phrases aay aove and where they aova In an English-type language, only 
one question-phrase aoves to [Spec, CP] at S-structure; in a Polish-type language aore 
than one question-phrase can be aoved at S-structure, one aoving to [Spec, CP] and the 
rest adjoining to clause-peripheral position and in a Chinese/Japanese-type 

language, all question-phrases reaain in-situ at S-structure. All languages, however, 
aove the question-phrases in-situ to clause-peripheral position at LF ^1] to indicate 
their scope. Interestingly enough, this typological distinction between overt (English 
and Geraan) and covert (Chinese and Japanese) aoveaents of question-phrases at 
S-structure seeas to be closely related to another typological difference in the 
position and function of the head (i.e. the coi^leaentizer C) of CP, which in turn 
seeas to be related to the typological distinction between the presence and absence of 
final particles as well as Auxiliary Preposing. In English and Geraaa C appears to 
the left of its IP coapleaent and aay be occupied by a coapleaentizer (a g. 'that, 
whether, for' in English) or serve as a landing site for an auxiliary to aove in; in 
Chinese and Japanese, on the other hand, C seeas to appear to the right of its IP 
coapleaent and aay be occupied by final particles (including the relative clause or 
aodification aarker (Sfy), as illustrated in (163), (164) and (165): 



subject raises to [Spec. IP] only at LF. 

59. In fact. Chinese and Japanese do allow optional aoveaent of question-phrases at 
S-structure, as illustrated in ' {fl/fcMAJ ^'P^JIfefitR? ' and ' {|i^:^l±fi« L T I > * 

60. Note that across-the-board aoveaent of a coordinate construction containing aore 
than one question-phrase is possible with English, as illustrated in ' When, where and 
how did John study?'. 

61. See Huang (1982) and auch subsequent work. 



ERIC 



-133- 



144 



(163) a. [cp [c -WH] [ip I know [cp [c Ihat 1 [ip John will cone ]]]]. 

b. [cp [c -WHl [ip I don't know [cp [c whether ] [ip John will coae IIJ). 

c. [cp [c -WH] [ip This IS [np the book [cp Op \P] Ic that ] [ip 1 bought t 

yesterday ]]]]]! 

d. [cp [c lUi 1 [ip John [i t ] come ]]? 

(164) a [cp [ip [cp [ip 1 [c -WH ]] (c 65l HI. 

b. [cp [ip [cp [ip 'J^#^^5J$ 1 [c +WH )]]]]. 

c. [cy [ip m [hp Icp Op [ip 1 ] [c IS 1 1 « 1 [c ffi 111' 

(165) a. [cp [pp [hp [cp [ip (.m) ;«^:6>*2> ] [c -*H 1 ] ^ i: ] l± 1 ^-^Tl^* 1 

b. t [ip im) ^t^^^t^ {j^^CiVif 9) A> ] [c +WH 1 li: 1 ^i^rii^ 1 
[c i£. ] ] 

c. [cp [ip CinU: [hp [cp Op [ip I/.*? rpB t Ho^c 1 [c ]] * 1 1 [c i ]]. 
4 [cp [ip >kg?l±**-r ] [c *i ]]? 

The coBple«entizer position is specified with the feature '-WH' or '+WH'. or si.ply 
left unspecified. In an English-type language, in which C precedes IP. the L 
containing the feature '-WH' will be filled in by the declarative coapleaentizer 
'that' (as in (163a)) or 'for' [63], depending on whether the following IP is a finite 
or non-finite (i.e. infinitival) clause: the C containing the feature +WH , on the 
other hand, will be filled in by the interrogative complementizer whether (as in 
(163b)) or 'if or when left empty, serves as a landing site for an tmxiliary to 

move in (as in (163d)); and the C without the feature specification -WH or +WH 
and dominated by both CP and NP, will be filled in by the complementizer that which 
introduces a relative clause (as in (163d)) or an appositional clause (^1 In a 
Chinese/Japanese-type language, in which C follows rather than precedes IP, the C will 
be filled in by various types of final particles that indicate the "ood or 
illocutionary force of the speaker. Thus, the C containing the feature -WH is filled 
in by declarative particles such as 'IS, ^. Ifg: ^. % (male-spGaker-oriented) 
(female-speaker-oriented), i (speaker-oriented), (addressee-oriented) . the C 

containing the feature '+WH', by interrogative particles such as }yfs-no 
question), Ifg (choice-type question and wh-question) . *> ; and the C containing 
neither '-WH' nor '+WH', by other particles such as '1^ (conjectural or optative). ^ 
(conjectural or optative), (certainty)' if it occurs in a root sentence, and by the 
subordinate marker '(fy: ^' C^l if it is dominated by both CP and NP. 

In English-type languages, the presence of the feature +WH under C not only 
indicates the semantic type of the following IP (i.e. question or Pq) and selects the 
proper complementizer in concert with the tense feature specification under 
I (nf lection) (or under T(ense), if there is an independent motivation for the 
existence of TP), hut also serves as a kind of 'force indicator which attracts the 
question-phrase to [Spec. CP]. Moreover, when the C containing the feature +WH 



62. 'Op' stands for the null operator or empty relative pronoua the movement of which 
leaves a trace 't' behind in the relative clause. 

63. As in 'I want very much for John to come. ' 

64. The interrogative complementizer 'whether' may introduce either finite or 
non-finite clauses (e.g. '1 don't know whether {I should go/to go) (or not)'), while 
'if may only introduce finite clauses (e.g. '1 don't know if {I should Ko/«to_£o) (or 
not)'). 

55. As in 'the rumor that Itery has eloped with Joha ' 

66. While the Chinese introduces relative and appositional clauses and functions 
as a subordinate or modification marker, the Japanese '(D' serves as a nominalizer. as 
in 'mi [pp [cp [ip *t*^^^T< 1 [c ^ ]) AM «|fU:n£o/-c'. Japanese clauses can also 
be nominalized by taking so-called "formal nouns' (e.g. 'Zt, i>(0. tZ.^, ), 
as in 'm [pp Icp [IP ] [c 11 AM tttCfio^C As for Japanese 

relative clauses, no relative or subordinate marker is needed. 
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occurs in a root sentence, it will also serve as a landing site for a aodal or 
aspectual auxiliary to move in, yielding a direct question. In Chinese/Japanese-type 
languages.' the feature specification under C also indicates the semantic type of the 
preceding IP and licenses the insertion of proper fina". particles which are in accord 
with the semantic type of the preceding IP. In the case of the feature '+WH', it will 
trigger a 'V-not-V" question if no question-phrase ocirs m the IP and, moreover, the 
I is also specified as '+WH', as illustrated in Chinese sentence (164b) [^7] and 
the Japanese sentence (155b). As the specifier and ti 9 head positions of CP in Chinese 
and Japanese are separated by the intervening IP, the sentence-final C containing the 
feature '+WH' neither attracts a question-phrase to [Spec CP] nor serves as a landing 
site for a modal or aspectual verb to move in from the constituent I posit ioa which 
seeas to account for the non-occurrence of f roa-ri ght-to-lef t WH-Fronting and 
Auxiliary Preposing m Chinese and Japanese surface sentences l*^). The raising of a 
final particle froa the constituent C position to the matrix C position, however, can 
happen when an interrogative particle which originates in the constituent sentence 
raises to the matrix sentence to receive "wide-scope' interpretation. Thus, Chinese 
verbs such as '^M. tj.^, may take questions or interrogative clauses as their 
complements only when these questions receive wide-scope interpretation; that is, the 
entire sentence is interpreted as a question. In this case, interrogative particles 
(e.g. % fi) originate in the constituent C position (and even trigger V-not-V 
question, as illustrated in (166b)) but raise to the matrix C position, as illustrated 
below: 

(166) a. [cp [ip ft^^ [cp [ip 1 [c ?i 111 [c +WH ]]■■' 

~> [cp [ip m% [cp [ip ] [c t ]]] [c S 11? 

b. [cp [ip (''FIU^ [cp [ip 

--> [cP [ip [cP [IP 1 [c 1 111 Ic ffi 11? 

c. [cp [ip m [cp [ip ii#5tS ] fc E 111 [c +WH 

~> [cP [ip m [cp [ip ti#5t5 1 [c t ]]] [c E jl? 

Languages can also be compared in terms of the Head-Initial versus Head-Final 
Parameter (or, alternatively, the Theta-Marking or Theta-Assignment Parameter, the 
Case-Assignment Parameter and/or the Argument-Placement Parameter [69]), which accounts 
for, among other things, the "mirror-image" phenomenon in linear order of constituents 
between English and Chinese. 

(167) a. John [vp [[[[ studied English ] diligently ] at the library ] yesterday ]]. 

b. [vp [ [ ^im t m iiii. 

The contrast between (167a) and (167b) clearly shows that, with regard to the head 
verb and its adverbial modifiers, English is head-initial and left-branching w'.il le 
Chinese is head-final and right-branching. If the linear order of modifiers is, in 
general, determined by semantic proximity between the head and its modifiers (i.e. 
modifiers which are sesantically closer to the head appear locationally nearer to it 
as well), then the reverse order of adverbial modifiers manifested in the head-initial 
English and the head-final Chinese is a natural consequence ['^^l. A corollary of this 
is, when these different types of adverbials appear together in the form of a 
question-phrase in a coordinate construction and move to the sentence-initial position 
as a result of WH-Fronling. the linear order of the adverbials will be the same for 
English and Chinese, since in this case the quest lon-adverbia Is precede the head 



67. For detailed discussions of Chinese "V-not-V" questions, see Huang (1991) and Guo 
(1992). 

68. For a somewhat different proposal to account for the occurrence and non-occurrence 
of overt WH-Fronting at S-structure, see Cheng (1991). 

69. We will not discuss here which parameter is the least stipulative or whether 
parameters can be unified into one. 

70. The fact that the object NP follows the head verb in both languages is accounted 
for by the Case-Assi gnment Parameter, according to which the transitive vero assigns 
accusative Case to its object NP from left to right in both languages. 
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verb [^^\ in both languages, tanifesting a surface "head-final" configuration. Compare: 
(168) a. Whea where and how did John study English? 

Finally, we would like to mention briefly how our "ainiaalist" approach might 
have relevance to the theory of machine translatioa 

By comparing not only the contents of the theta-grids for the corresponding verbs 
of English, Chinese and Japanese but also the way they project into surface sentences, 
we think our theta-grids, along with the few principles • and parameters discussed 
above, provide vital information for language parsing in a very simple format. Among 
other things, the number of obligatory arguments, the distinction between internal 
(i.e. object), external (i.e. subject) and semantic (i.e. adjunct) arguments, the 
optionality and permutabi lity of tirguments at S-structure, the syntactic categories 
that these arguments are turned into (via canonical structure realization), the 
selection of marked and unmarked adpositions (including prepositions, postpositions 
and subordinate conjunctions) for the arguments, and partial linearity among the 
arguments, are either listed in theta-grids or handled by lexical redundancy rules. 
The mapping of theta-grids to surface sentences is also quite simple and 
straightforw£ird, since there are only a few principles and constraints to observe and 
a few parameters to choose from. 

The X-bar Convention, simple in form and content, seems to obviate the necessity 
for phrase structure rules which essentially repeat the information already provided 
in the theta-grids. If endocentricity of phrasal constructions and binary branching of 
their constituents are strictly observed, the majority of improper structural 
descriptions will be eliminated, thus considerably reducing tlie burden of sentence 
parsing. Furthermore, transfer rules can also be reduced, to the minimum or even 
entirely eliminated. Suppose that the lexicon (or data base) consists of lexical items 
which contain a theta-grid as part of their lexical entries and, furthermore, that the 
lexical entries for the corresponding verbs, adjectives, nouns, adverbs, adpositions, 
etc. between the input and output languages are listed side by side in the lexicon. 
Instead of writing separate sets of phrase structure rules for each language and two 
different sets of transfer rules for each pair of languages [72 )_ we simply look up the 
lexical items appearing in the input text in the lexicon, match them with the 
corresponding lexical items in the output language and, using the information provided 
by the theta-grids for these lexical items (especially, the predicate verbs and 
adjectives), translate the input text into the output language, by the parallel 
analysis of the input text in terms of theta-grids and mapping of the corresponding 
theta-grids in the output language to surface sentences. In this sense, our approach 
is "lexicon-driven" and "multi -directional". It is admitted, however, that there are 
still some problems to be solved (e.g. exactly how many theta-roles are necessary for 
natural languages, and how should we identify them?) and thorny technical details to 
be worked out (e.g. how should the contents and functions of theta-grids be stated in 
a computer language so that they can be readily understood by the computer?) ['^3] 



71. If these question-adverbials are analyzed as sentential adverbials, then they also 
precede the head IP (i.e. S). 

72. This means that, given n languages, there need to be n sets of phrase structure 
rules and n(n-l) (=n -n) sets of transfer rules. 

73. For a more detailed discussion of our approach to machine translatioa see Tang 
(1992d), which deals with more specific issues such as the "garden path" phpnomenon in 
language processing and the disambiguation of syntactic ambiguity 
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TYPES OF TONE SANDHI IN MANDARIN DIALECTS AND A 
FORMAL MODEL OF TONE 



Mei-chih L. Chang 
Nstionai Chung-Hsing University at Taipei 

0. Introduction. Like other phonological processes, tone sandhi in general may be 
viewed as natural processes which are to a large extent phoneticaUy motivated. 1 During 
the past few decades great strides have been made in our understanding of the working of 
tone sandhi in individual languages as more detailed documentation has become available. 
Howe\ er, some broader, fundamental questions such as the nature of tone sandhi, the 
properties of tone sandhi rules, and the relation of tone sandhi to other prosodic units 
have not been fully answered. With regard to the Chinese languages, our understanding 
of these broader issues remains poor even within the better-stodied languages mostly due 
to the complexities of diaiect variation and to incomplete dociunentation. Despite the 
difficulty of obtaining sufficient data, initial attempts to probe into the broad issues 
regarding tone sandhi can be made based on the data accessible so far. This paper thus 
intends to serve as a preliminary sUidy of the nature of tone sandhi as it looks into the 
types of tone sandhi found in a number of Mandarin dialects documented and 
representative of different Mandarin-speaking regions. In this preliminary investigation, I 
single out two widely discussed and highly controversial dialects, Changzhi and Pingyao, 
for lengthy expositions, and offer alternative analyses of their peculiar tone sandhi 
behavior. In concluding the paper, I disucss the implications of these tone sandhi 
properties for constnicting a general theory of tone. 

1. Tone Sandhi Rules in Mandarin Dialects 

1.1. Dialects surveyed and their tone systems. Among the major Chinese 
languages. Mandarin has the most speakers and occupies a vast territory (Norman 1988). 
The'ianguage (i.e. entiie dialect group) is further divided into subgroups on the basis of 
geographical locations. 2 For ttiis study, representative dialects from each major region 
for which sufficient documentation is available are used. Before we discuss the tone 
sandhi, the tone systems of the dialects stodied are first listed in ( 1) below. 
( 1 ) Tone systems of Mandarin dialects 

Tonal categor y 

Beijing(BF) 

Tianjin(BF) 

X uzhou(BF) 

Xian (XB) 

Pmgyao(XB) 

Chan gzhi(XB) 



Chengdu(XN) 
Wuhan(XN> 




Zhenjiang(JH) 42 35 31 

Yanj>zhou(JH) 31 34 42 

Hefei (JH) 212 ?5 24 



55 
55 
53 



5 
4 
4 



5 
4 
4 
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In tenns of tone sandhi in these dialects, two general obsen'ations are worttiy of 
note: first, tone sandhi does not seem to be conditioned by the tonal categories which 
correspond historicaUy with the Middle Chinese categories; instead, tone sandhi is clearly 
related to the tone values in the modem dialects and is perhaps triggered by other 
prosodic factors such as strest, and the prosodic template in the dialect Second, tone 
systems with almost identical tonal inventories (e.g.. Zhenjiang and Yangzhou) may not 
have identical sandhi processes; however, dialects witliin the same group tend to share 
certain sandhi processes. These points are made clear in subsequent discussions. 

1.2. Tone sandhi in the northern (BF) dialects. The best known tone sandhi in 
Mandarin is that of Beijing in which a third tone changes into a high rising value (i.e. T2) 
when followed by other T3 syllables.^ In the BF subgroup, the northeastern region 
(Manchuria) is most similar to the Beijing dialect; e.g., Shenyang (as documented in 
Yuan 1960) has the same tone system and the T3 sandhi as that of Beijing except for the 
lower register on the first tone ([33]). In contrast the dialect spoken in Tianjin, about just 
120 kilometers southeast of Beijing, has a set of tone sandhi quite distinct from that of 
Beijing, despite the obvious similarity in tone values in citational tone categories. Tianjin 
tone sandhi occurs pervasively in the preceding syUable of a disyUabic string, the rule of 
which is iiiustratefJ below 

(2) Tianjin tone sandhi (Li and Liu 1985):<5 

a. 2i->213/ 21 i.e. p'-> h/ 1 _ 1 

b. 213 -> 45 / 213 i.e. L --> W L 

c. 53— >21/ 53 i.e. H.h 1 --> L/ H.h 1 

d 53— >45/ 21 i.e. hl-->lh/ L.1 

Thiese sandhi processes are clearly dissimilatory in nature; i.e., both (2b) and (2c) 
involve register diijsimimtion, while (2a) and (2d) involve the dissimUation of pitch 
specifications. With regard to the complex contour tone [213] and rule (2a), some 
clarification is needed; that is, in Ught of the rare occurrence of complex contours 
preceding odier syllables, (2a) seems odd for it changes a low faUing tone to a complex 
contour (i.e., low-falling-rising) when followed by another low falling tone. If the 
complex contour tone documented in Tianjin is similar to that of T3 in BM, the duration 
of the syUable bearing this tone should be much longer than that of the syllables bearing 
other tones. However, according to the phonetic studies by Shi (1990), the duration of 
the documented complex contour tone in Tianjin, whether it be the sandhi tone in (2a) or 
the citation tone in (2b), is not any longer than that of other tones. Based on this finding 
and the appearance of a clear rising contour in the phonetic studies, Shi treats this tone as 
simply a low rising tone [13], of which analysis is adopted here. 

Another relatively well-documented BF dialect is Xuzhou, spoken in noithem 
Jiangsu close to the border with Henan and Shangdong provinces. Like Beijing, major 
sandhi in Xuzliou is concerned with the complex contour tone [213]. 

(3) Xuzhou tone sandhi: 

213— > 13/ 213 

213 — > 22 (or 21) / X (X: all other tones) 

(3) is reminiscent of the T3 sandhi in Beijing: the complex contour Is never fully 
realized unless prepausally. The difference between the Xuzhou sandhi and the Beijing 
T3 sandhi is mainly in the register dissimUation which Beijing undergoes, but Xuzhou 
does not In addition to (3), a lexical sandhi process in Xuzhou merits some discussion. 
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In reduplicated forms, there is a strong tendency for the second reduplicated syllable to 
surface as a high level tone [55] J In addition, there is triplication m the Xuzhou lexicon 
which gives the following patterns: 

(4) Triplication in Xuzhou: 



a. [213]: [22- 55-213] 

b. [55]: [55- 55 - 55] 

c. [35]: [35- 55- 213] 

d. [42]: [42 - 55 - 213 or 42] 



With the exception of (4b), one may consider the second and third syllables in 
triplicated forms to follow a template with the first syllable maintaining its original tone.^ 
It is most likely that the high level tone in the middle is motivated by weak stress; 
evidence of this comes from a similar reduplication process involving three syllables (i.e., 
the BBA pattern), in which the second syllable can be either a [55] or neutral-toned (0). 

(5) TripUcation and BBA reduplicates mvolving [213] (Li 1985:36-38):^ 

a. [tian213]: [tian22-tian55-tian213] 'everyday' 

b. [ban213-ban213-gao213]: (i) [ban22-ban55-gao213] 'that tall; the same 

(ii) [ban22-ban0-gao213] height' 
1.3. Tone sandhi in the southwestern (XN) dialects. Tone sandhi in the XN 
dialects tend to be limited. The following are the tonal alternations documented for 
Chengdu (in Sichuan), and Wuhan (in Hubei) repectively (Beijing Daxue 1964, and 
forthcoming):^^ 

(6) Chengdu lexical sandhi: 

a. 31 —> 44/ 31 i.e. L— >H/L 

13 —> 44/13 i)f ix xy 

b. 53 — > 31/53 (in some reduplicates) i.e. Jfi — > L / ^ 

( — > 44 (in a few isolated cases)) hi hi 

(7) Wuhan tone sandhi: 

213 — > 13/ 213 

213— >21 / X 

(7) is the same as (3), once again indicating the preference of partial realization of 
the complex contour tone. The Chengdu lexical sandhi (6a) and the major pattern of (6b) 
are both dissimilatory; i.e., in a sequence of identical tones, low register contour tones 
become high register level tones, whereas a high falling tone becomes a low falling tone. 
Furthermore, note that morphemes which undergo these changes tend to be functional 
morphemes (e.g. suffixes) bearing little stress. Comparing Chengdu with nearby 
Kumning (in Yunnan) which has exactly the same tonal inventory as Chengdu, we find 
some differences in sandhi environments, but virtually the same processes (i.e., tonal 
change on weakly stressed syllables such as the second syllable in a reduplicated form 
and a suffixed form). 

(8) Kunming lexical sandhi (based on the data from Beijing Daxue 1964): 

53 — >44/53 

(in limited cases) i.e. — >H / X 

31 — >44//3l) hi h xy 
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1.4. Tone sandhi In the eastern (JH) dialects. Two JH dialects are examined 
here. Hefei, spoken in Anhui, has the following lexical sandhi, based on the data from 
the revised version of the Lexical Survey (due to Wang Hongjun, p.c). 

(9) Hefei lexical sandhi: 

a. 212— > 12/ 212 



I.e. 



212 — > 21 / 



L 
/ l\ 
hih 



L 
/ l\ 
hih 



21 — > 12/ 



53 



24 — > 33 / 



4 — > 21 / 




k 
hi 

H 

/ 

1 h 



1 



— > 



Ih 



/ 



H 

Ti 



--> 



L 
I 



/ 



(X: all but the same tone; Y: ali but the falling tone; Z: all but a L tone or a falling tone) 
(9a) and (9b) aie not different from the partial association of the complex contour 
tone which appears widely in other dialects examined so far. (9c) is crucially ordered 
after (9b) and can be considered as a contour dissimilation rule which metathesizes the 
tonemes when followed by a falling tone. The dialect's preference for alternating pitch 
contour may t>e indicated by the more complicated rule (9d) in which the rising tone [24] 
undergoes contour simplification (i.e., becoming a mid tone) when followed by all but the 
falling tone [53] and rule (9e) which alters the high checked tone to low when followed 
by all high tones except the high failing tone. 

Another IH dialect, Zhenjiang, spoken in Jiangsu, exhibits the following sandhi 
alternations: 

(10) Ziienjiang lexical sandhi (based on data from Zhang 1985): 



42 

31 

42 
31 
35 



— > 



-> 33/ 




I.e. 



/ \ 
h 1 



->l\/_ L\ 



X y--> h / X 

(X: all but the same nonfalling high tones) 
(10a) is clearly a contour metathesis rule which dissimilates the first falling 
contour when followed by another falling tone; moreover, when two low falling tones are 
in a sequence, there is also register dissmilation as well as contour metathesis. In (10b) 
falling tones undergo contour simplification when followed by nonfalling high tones. 
High rising tone is simplified to level when followed by (high) level tones in (10c). 

1.5. Tone sandhi in the northwestern (XB) dialects. Among all the Mandarin 
dialects of which documentation is available, the XB dialects exliibit the most sandhi 
complexity. Pmgyao and Changzhi, two widely discussed dialects spoken in Shanxi are 
to be closely exammed in Section 2. As we shall see, the tone sandhi in these two 
dialects involves register spread (i.e. assimilation) as well as dissimilation, contour 
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53 



metathesis and a number of tlie prosodic conditioas in the case of Changzhi. Despite the 
preliminary nature ot the investigation, these properties are important in understanding 
the extent and working of tonal alternations in Mandarin. Before we get into the detailed 
discussion of Pingyao and Changzhi. we look at Xian. a XB dialect with relativel\ simple 
tone system and tone sandhi. Based on the Lexical Siurey (Beijing Daxue 1964), Xian 
has two sandhi alternations witliin the lexicon as in ( 11 ). 

(11) Xian lexical sandhi: 

a. 21— >24/ 21 i.e. L ^ 

b. 53— > 21/53 H H 

/ s / \ 

h 1 — > L / h 1 

Both rules are dissimilatorj'. (11a) mvolves both register and contour 
dissimilation, whereas (lib) involves only register dissimilation. Uhe sketchy 
documentation prevents us from knowing whether these two alternations apply generally 
without exception and whether they cover all the tf)nal alternation cases in Xian. 

Before a simunary can be made of the types of tone sandhi found in this smdy. let 
us now turn to the discussion of the tone sandlii in Pingyao and Changzhi. 

2. Tone Sandhi in Pingyao and Changzhi Revisited 

2.1. Pingyao. There are five citation tones in Pingyao which have been listed in 
(1): [13], [53], [35], [23] and [54]; but since the two checked tones [22], \5A\ have 
exactly the same patterns of sandhi alternation as those of [13] and [53], the two sets can 
be treated with the same underlying representations. In disyllabic strings, the first 
syllable tends to undergo sandhi alternations, giving rise to patterns in ( 12). 

(12) Oi \ 13 35 53 
13 *" 13-13 J/-35 35-423 
35 13-13 J/-35 35-423 
53 53-13 53-35 55-423 

Bao (1990) first brings two cases of register assimilation in Pmgyao to the 
spotlight in support of the process of register spread predicted by his tone model. One 
case is the [13-13] pattern surfaced when [35] is followed by [13]: the other is the [35- 
423] pattern resulting fiom [13] followed by [53] (the underlined forms in (12)). In his 
analysis, a metathesis rule is needed to account for the italicized patterns in (12), and a 
phonetic detail rule accounts for the surface pitch change from [53] to [423] in the third 
colimin. Furthermore, when two [35] syllables are in a sequence, the siuface pattern of 
[31-35] seems to require a register dissimilation rule in addition to contour metathesis. 
(13) summarizes the four rules proposed by Bao to account for Pingyao lexical sandhi: 

(13) Bao's rules (1990:91-95):" 

(a) Register lowering: 

H -> L / T e.g. [35]->[13]/__[35] 

1 h 1 h 

(b) Contour metathesis: H e.g. [13]->[31]/ [35] 

X y — > y X / X ^y [53] -> [35] / [53] 

(c) Register assimilation: 

R — > aR / . «R e.g.. [13] -> [35] / _^ [53] 

1 ii [35]->[13]/__[13] 
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(d) Contour formation: (phrase- final detail mle) 

h 1 e.g. [53] --> [423] / # 

The register assimilation proposed for Pingyao has generated some debate. In 
defense of her 1989 model which does not allow register spread. Yip (1992) argues that 
the register assimilation observed by Buo (as in rule (13c)) is subject to an alternative 
analysis proposed by Chen (1991) and thus cannot be taken as a case of register spread. 
Chen's argument against Bao's analysis of Pingyao tone sandhi is based on simplicity 
considerations; i.e., he claims that the two rules (13a) and (13c) can be collapsed based 
on the observation stated in ( 14) below: 

(14) Register neutralization: For a rising tone, if the followmg syllable 
begins with h (or 1), replace the Register value with H (or L).l2 
Since this rule does not require register spread, Yip ( 1992) argues that Bao's claim 
of register spread is not valid. 

On this. I object to Yip' argument for two reasons. First, ingenious and simple as 
it is. Chen's register neutralization rule poses serious theoretical problems. What is 
shorthanded as H/L and h/1 in his statement has to be represented by two distinct tone 
features at distinct tiers in current theoretical models, and feature spreading across tiei-s is 
generally prohibited. U seems hardly justifiable to sacrifice theoretical rigor here just for 
the sake of simplicity. Second and more importantly, there is other evidence in Pingyao 
which does not support Chen's register neutralization analysis, but one with register 
spread, as we are to see now. In Hou's 1980 documentation of Pingyao, tone sandhi 
patterns are given for three types of disyllabic compounds classified on the basis of the 
granmiatical relationship between the two syllables. Patterns listed m (12), those smdied 
by Bao, demonstrate only what Hou termed Type A compounds.'-^ ITie diverse Type B 
compounds exhibit tone patterns substantially different from those of Type A. Some 
examples of Type B compounds are given in (15) followed by a summary of the tone 
sandhi patterns m (16). 

(15) Type B disyllabic compounds in Pingyao: 

(a) [u 13 Tjlack*- ta 13 'crow'] — > [31-35] crow, raven' 

(b) [t6i 13 'hunger' - xuai g 13 'barren'] — > [31-35] 'famine' 

(c) [t6ia 13 'home' - t6y 35 'utensil'] —> [13-13] 'fiimiture' 

(d) [tiE 13 'shake' - tO 53 'upside down'] — > [31-53] 'reverse' 

(e) [tei 35 'big' - mEng 13 'door'] — > [35-53] 'front gate' 

(f) [ts'ung 13 'viUage' - Z'^? 13 dimin. sufx.] -> [31-35] '(small) village' 

(g) [xuE 35 'aUey'- xuE 35] — > [35-53] 'alley' 

(h) [uang 13 'curvy' - uang 13] — > [31-35] 'curvy, bent' 

(16) Type B tonal alternations: 

Gi I <52 13 35 53 

13 31-35 or 13-13 or 31-53 or 

13-13 31-35 35-423 

35 35-53 35-53 35-53 

53 53-13 53-35 53-53 or 

25-53 
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The bold-faced patterns in (16) are the common patterns for Type B compounds 
(i.e., the second row starting with a [13] syllable and the second pattern for the [53-53] 
sequence which occasionally occurs when the first syllable is a checked [52] tone are 
minor patterns). Note that these minor patterns are exactly the same patterns for the 
corresponding sequences of the Type A compounds (cf. (12)).''* One may speculate 
their appearance in Type B to be due either to an incomplete process of lexical difftision 
or to dual-pattern sandhi alternations in certain types of compounds. For the purpose of 
our discussion, only the more common patterns for the Type B compounds are taken into 
considCTation. 

It is clear that metathesis also plays an important role in the sandhi patterns here. 
Metathesis applies to [13] when it is followed by [13] and [53] and when preceded by 
[35], and applies also to [35] when preceded by a [35]. These metathesis processes can 
be captured by the following rule: 

(17) Metathesis for rising tones in Type B compounds: A rising tone (I h) 
becomes falling (Ti I) when the preceding tone is a high rising tone (H/1 h) 
or when the following tone starts at either the bottom (LA) or the top (H/h) 
of the entire pitch range. 

f H 

I.e. I h — > h 1 / 1 L L / H 

I I I 
1 / h 



Comparing this metathesis rule with that of Type A (as formulated by Bao in 
( I3b)), we find that this rule is less general than (13b) and the conditions under which the 
rule appUes are more cimibersome. However, it is possible to conclude that in Pingyao, a 
metathesis process appUes generally to the rising tones, but the environments in which the 
rule applies differ somewhat according to different types of compounding relationships. 
The second sandhi process observed in Type B is register assimilation, which accounts 
for the register lowering of [35] when it is preceded by [13], and the register raising of 
[13] (which after metathesis becomes [31]) when it is preceded by [35]. This rule is 
stated as follows: 

(18) Register assimilation: The register value of the second syllable is 
assimilated to that of a preceding rising tone in the Type B disyllabic 



compounds. i.e. R — > a R / a ^ 



This rule is again reminiscent of the register assimilation rule (13c) given by Bao 
for the Type A compounds. The main difference lies only in the direction of the spread. 

Before we go on, let us return to the crucial debate regarding register spread in 
Pingyao. In the Type B compounds, the register raising and lowering occur in the second 
syllable instead of the first syllable of the compound as in the case of Type A. Assuming 
that both types of compounds share similar sandhi processes as we have discovered, we 
then would expect a register neutralization process along the line of (14) in Type B as 
well. However, no evidence can be successfully put forth for any such register 
neutralization process here; that is, the process of register assimilation (i.e. spread) does 
not seem dispensable in the account of Type B tonal alternations. Furthermore, as 
mentioned earlier, even if the register neutralization rule may conveniently account for the 
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register assimilation/dissimilation in the Type A compounds, the rule itself is theoretically 
problematic. Therefore, it is best to conclude thai no register neutralization rule exists in 
Pingyao; instead, a rule of register assimilation is present in the Pingyao phonology', 
which applies to both Type A and Type B compounds under somewhat different 
conditions. 

In addition to metathesis and register spread, a register dissimilation rule is needed 
to account for the register-raising of [13] when it is preceded by a metathesized [13). 
This rule is formulated as ( 19) below: 

(19) Register- dissmilation (L- raising) in Type B: (crucially ordered 
before (18)) The register of the second syllable is dissimtlated to high (H) 
when the preceding syllable has a L register, i.e. L — > H / ' 

Again, this rule is also reminiscent of the register lowering rule (13a) fonnulated 
by Bao in which the tlrst syllable of a [35-35] sequence undergoes a dissimilatory 
process, lowering its register. In comparison, (19) applies to the second syllable of a [13- 
13] sequence, raising the registei- value in this case. ''5 The derivations of the sandhi 
patterns of Type B disyllabic compounds are given as follows: 

(20) Derivations of Type B compounds: (Rules (17). (18). (19)) 





Metathesis 1 


L-raising 1 


R-spread 1 


Output 


13-13^ 


31-13 


31-35 


n/a 


31-35 


13-35 


n/a 


n/a 


13-13 


13-13 


13-53 


31-53 


n/a 


n/a 


31-53 


35-13 


35-31 


n/a 


35-53 


35-53 


35-35 


35-53 


n/a 




35-53 



Our findings regarding Pingyao tone sandhi may be summarized as the following 
two points: first, the process of register assimilation is strongly substantiated by the two 
t>'pes of Pingyao disyllabic compounds examined in this section. Yip's argument against 
register spread on the basis of the register neutralization rule proposed by Chen is 
untenable. Second, the great extent of similarity shared between the rules accoimtina for 
the Type A and the Type B tonal alternations does not appear to be a coincidence. It 
seems plausible that, for a language, there is a set of core phonological rules which may 
var\' to certain degree (e.g. the conditions under which the rules apply) to make distinct 
different categories or functions. This latter point is worth pursuing further and may find 
support in some other Mandarin dialects to which we are to return shortly. 

2.2. Changzhi. Like Pingyao. Changzhi is also a Mandarin dialect spoken in 
Shanxi of which the tone sandhi patterns have generated much debate. According to the 
documentations by Hou ( 1983), Changzhi has seven citation tones (i.e. [213], [24], [535], 
[44], [53]. [4]. and [54] as listed in (I)) and a number of different tone sandhi patterns 
which appear in different granmiatical functions. In the following, I give some examples 
of the suffixed forms and the verbal reduplicates: 
(2 1 ) a. Nominal suffix AE; tE?/: ' ^ 

(i) ts'E[213] [ts'E213-tE?213] 'car' 

(ii) lung [24] [lung 24 - tE? 24] wheel' 

(iii) i[535] [i 535 - tE? 535] 'chair' 

( iv) t6iO [44] [t6iO 44 - tE? 535] 'sedan-chair' 
(V) tEu[53] [tEu 53 - tE? 531 'pea' 

( vi) kuE? [4] [kuE? 4 - tE? 4] 'valley' 
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(vu)tsuE?[54] 


[tsuE? 54 - tE? 54] 


'bracelet' 


b. Adjectival suffix /ti/: 






(i) suang[213] 


[suang 213 - ti 213] 


'sour' 


(ii) xuang [24] 


[xuang 24 - ti 24] 


'yellow' 


(m) lEng [5351 


[lEng 535 - tl 535] 


cold 


(iv)ts'Eu[44] 


Its-Eu 44 - ti 535] 


'stinky' 


(V) la? [4] 


[la? 4 - ti 4] 


'spicy' 


(22) Verbal reduplicates: 






sang [213] 


[sang 213 - sang 35] 


'to fan' 


t6'iEu [24] 


[t6'iEu 24 - t6'iEu 53] 'to beg' 


ts'O [535] 


[ts'O 535 - ts'O 35] 


'to fry' 


k'ang [44] 


[k'ang 31 - k'ang 53] 


'to look' 


ung [53] 


[ung 35 - ung 53) 


'to ask' 



In Bao's 1990 analysis, he treats the suffixed forms (e.g. (21)) as cases of tonal 
spread; i.e., suffixes /tE?/ and /ti/ get their surface tonal realizations fiom the preceding 
syllables except in the case of [44]. "When the first syllable is [44], the following suffix 
surfaces with a [535] instead of a [44] tone. Bao attempts to explain away this exception 
by claiming that [44] is a default tone (i.e. an unmailced tone) in Changzhi, and that, since 
it is not specified, the underlying tone associated with the suffix surfaces. This analysis 
offered by Bao faces some difficulties: first, although Hou ( 1983) has stated cleaily in 
his field report that Ai/ has the underlying tone [535], it is not clear whether or how /tE?/ 
gets the same underlying tone [535], since checked syllables carry either a short [4] or a 
short [54] tone in Changzhi. Second, the two checked tones in Changzhi generally have 
the same tonal alternations as [44] and [53] and thus can be treated with the same 
underlying representations as [44] and [53]. Now if [44] is unspecified (i.e. the default 
value) and cannot spread onto the suffix, one must wonder why the same surface pattern 
does not arise when the short checked tone [4] is the first syllable (i.e.. compare (21a(vi) 
with (iv), and (21b(v) with (iv)). Treating the [44] case as an exception which needs 
some special stipulation, Duanmu ( 1990) suggests that the whole tone spread in Changzhi 
suffixed forms can be alternatively analyzed as tonal copying; i.e., the suffixes are 
considered toneless and get their tones by a tonal copying process which copies the tones 
of the preceding syllable. 

On this issue. Yip (1992) argues against Duanmu's tone copying analysis of tiie 
suffixed forms by pointing out the difference in tonal alternations between the suffixed 
forms and the verbal reduplicates (e.g. (22)). In Yip's view, since the tonal melodies in 
verbal reduplicates do not seem to be held fixed by a prosodic template, the reduplication 
process must copy the tones as well as the segments before tone sandhi subsequently 
occurs. She then questions that if both suffocation and verbal reduplication involve 
copying the tone from the first syllable, why the suffixed forms and the verbal 
reduplicates do not exhibit the same tonal alternations. Consequently, Yip concludes that 
suffocation (as in (21)) involves tonal spread, spreading the whole tone of the first 
syllable onto the suffix with no further sandhi, whereas verbal reduplication (as in (22)) 
involves tonal copying, followed by tone sandhi. However, another set of the tonal 
altematioas is noted by Yip. which occurs in disyllabic compounds with the same 
underlying (or citation) tones (examples listed in (23) below). Along the same line of 
reasoning, Yip is perplexed by the question why the patterns seen in (23) differ from 
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those in (22), since, in her view, after reduplication tbe base tones in (22) should be the 
same as the corresponding forms in (23) and presumably undergo the same sandhi 
processes. She freely admits that she has no explanataion for the different sandhi 
alternations seen here. 

(23) Disyllabic compounds with the same underlying tones: 

a. [kung 213 'male' - t6i 213 'chicken'] --> [213-53] 'rooster' 

b. [k'ae 213 'open' -ts'E 213 'car'] --> [35-213] 'to drive' 

c. [sE 24 'snake'- y 24 'fish'] -> [24-24] 'eel' 

d. [mu 535 'female' - ma 535 Tiorse'] -> [35-53] 'mare' 

e. [t6iO 535 'to wring' - liang 535 'face'] ->[35-535]'to scrape facial hair" 

f. [ts'Eu 44 'stinky' - fang 44 'charcoal'] -> [53-44] 'smoky charcoal' 

g. [t6iEu 53 'old' - fang 53 'cooked rice'] -> [53-53] 'leftover' 

So far no phonological analysis of tone sandhi in Changzhi can be considered 
satisfactory. Bao (1990: 126-34) has attempted to account for the suffixed forms as a 
case of tonal spread, and the tone sandhi in the verbal reduplicates by a set of rules, but 
has ignored cases such as (23). In Bao's analysis of the forms in (22), a number of 
category-specific and general rules are proposed, We first give Bao's analysis of the 
underlying representations of Changzhi tones in (24) and the rules proposed for the 
derivations of the surface tones in verbal reduplicates in (25), followed by some 
derivations based on Bao's proposal. 

(24) Bao's analysis of Changzhi tones (with shorthand modifications): 

[213] L [535] H [24] Ij[ [53] H 
hi hi 1 h 

(25) a. Default: [ ] -> [-stiff] (i.e. [ ] -> L) 

[ ] -> [-slack] (i.e. [ J -> h) 

b. Contour formation: 

I "-^^ 

[a slack] [-a slack] (i.e. h h — > h 1; 1 1— > h 1) 

c. Verbal reduplcation: (with informal simplifications) 

(i) h I — > jH^ 

1 h / ] 

(ii) cf )l 

X — > h 1 / ] 

(iii) Metathesis: 

A, 

(26) Derivations of some verbal reduplicates with Bao's rules: 

a. [213] (UR: L/h 1): 
redup. (25ci) (25b) 
[21 21] > [21 35] > [213 35] 

b. [53] (UR: H/h): 
redup. (25cu) (25b) (25ciii) 
[55 55] > [55 53] > [53 53] > [35 53] 



h 1 — > 1 h / H 

I 
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c. [44] (TJR: unspecified): 

(25a)def. redup. (25cu) (?.5b) 

> L . h > [33 33] > [33 53] > [31-53] 

According to Bao's analysis, the surface citation tones result from application of 
contour formation (i.e.(25b)) to the underlying forms in (24) and the application of 
default rules in the case of [44]. For the sandhi alternations in verbal reduplicates, 
category-specific rules (25ci, ii) account for the rising tone on the second sy Cable when 
the first syllable is [213] and [535], and the falling tone when preceded by [24]. f44] and 
[53]. Except for the sequence [53-53], where a metathesis rule (25ciii) is further needed 
to give the correct [35-53] pattern, contour formation (25b) accounts for the surface 
contours of the first syllables. 

With regard to Bao's arialysis, Duanmu ( 1990) has pointed out that there is no 
evidence that [44] is the default tone in Changzhi. Moreover, instead of being 
explanatory, Bao's category-specific rules (e.g. 25c i-iii) are merely formulated 
observations. To gain insight into the Changzhi tone sandhi, T .state a number of general 
observations regarding the tonal variations in Changzhi below. 

(27) General observations regarding Changzhi: 

a. No low initial: No tones in Changzhi start off with an extreme low 

pitch (e.g. L/1). i.e. * [ 

h. No high level final: No high level tone exists prepausally (or at word 
boundary). i.e. * H ] 

ii h 

c. Alternating contour tendency: In the sandhi patterns of disyllabic 
compounds, tonal alternations seem to be to a large extent motivated by a 
preference for alternating contours (i.e. falling (h 1) followed by rising 

(1 h) or vice versa). i.e. / \ / \ 

* x y X y 

d. Contour forming tendency: For underlying level tones, the surface tone 
shape seems to be to a large degree determined by the beginning pitch 
level of the following tone; i.e., a mid tone becomes rising if the following 
tone starts with a high pitch; a high tone becomes falling if the following 
tone starts with a low pitch, i.e. / i^^J or / ^'-.^/ 

1 1 h h h 1 
It seems most likely that these observed phenomena are all phonetically 
motivated. (27a) and (27b) are in accordance with acoustic properties of utterance- initial 
and utterance- final pitch contours (e.g. Maddieson 1978). I will consider them to be 
constraints in Changzhi and henceforth refer to them as constraint A (i.e. (27a)) and 
constraint B (i.e. (27b)). (27c) and (27d). on the other hand, help to maintain a steady 
alternating contour rhythm and perhaps facilitate ease in articulation. 

As for the underlying representations of the Changzhi tones, I consider Bao's 
stipulation of contour formation (i.e. (25b)) unverifiable and overly abstract, fn the 
analysis which I am to present in (28), I avoid this abstraction and treat the underlying 
tones to be basically similar to their surface forms. 
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(28) ReanalysLs of Changzhi tones: 

[213] L [535] H [24] H [44] L [53] H 
/ I \ / I \ / \ I I 

h 1 h h I h I h h h 

Instead of proposing a set of unmotivated rules, I consider verbal reduplication to 
be a tonal copying process which conforms to the conditions set by the template of verbal 
reduplication. 

(29) Template conditions for verbal reduplication: The first syllable bears 
a contour tone, while the second syllable must be in a high register. 

i.e. a I G2 

I \ I 

X y H 

( ?0 ) V erbal reduplication: 

a. Copy segments and tones from the first syllable. 

b. When more than one toneme is copied, treat the leftmost toneme as 
extratonal; associate the rest of the tonemes to the moras of the second 
syllable from left to right, in a one-to-one fashion. 

c. Apply constraint B and constraint A wherever applicable. 

d. Associate the inserted toneme required by the template for <5\ from the 
left, unless otherwise constrained. 

In this analysis, the surface tone patterns of the verbal reduplicates are the result 
of the interaction between the template and the tonal copying process of verbal 
reduplication, further coristrained by the two phonetic constraints in Changzhi (i.e. (27a) 
and (27b)). The deiivations are given below in (31): 

(31) Derivation of verbal reduplicates: ((29) + (30))-^ 

a. [213] (UR: L/h 1 h): 

(30a) L L (30b)+(29) L L K 

> / I \ > / I \ I \ — > [213-35] 

h 1 h h 1 h h 1 h (h) 1 h 

b. [24] (UR: H/lh): 

(30a) H H (30b) H H (27b) H H 

> / \ > /\ I > /\ / -> [24-53] 

1 h 1 h 1 h (Dh 1 h h 1 

c. [44] (UR: L/h): 

(30a) L L (29H-(30d) L H (27b) 1. H (27a) + (30d) L H 

— -> I > / \ I > / \ I > / \ / \ 

h h 1 h h 1 h h 1 h l(*lh)hl 

— -> [31-53] 

d. [53] (UR: H/h): 

(30a) H H (30dH-(29) H H (27b) H H 

> I > /\ I > /\ /\ — -> [35-53] 

h h 1 li h 1 h h \ 

As for the disyllabic compounds with the same underlying tones (i.e. cases in 
(23)). the tonal alternations involve no template but only tone sandhi rules, plus 
constraints (27a) and (27b). I wo rules may be formulated for the observations given in 
(27c) and (27d); that is. the tendency for alternating contour (27c) suggests a 
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dissimilatoiA' prf)cess on adjacent contours, while the tendency for contour fonnation 
(27d) points to a pitch assimilation process which creates a contour by allowing pitch 
spread from the following syllable. In addition to these rules, a register dissimilation 
process can be seen m (23a, b & f) in which two identical U)nes of low register aie in a 
sequence. In other words. (23a) and (23b) have undergone a contour alternation process 
along the lilies of (27c) together with a register dissimilation process, while (23f) has 
undergone a contour fonnation constrained by no- low- initial (i.e. (27a)). then followed by 
a register dissimilation. The register diijsimilation rule is giv en belov ; 

(?2) Register dissimilation in Changzhi disyllabic compounds: When two 
tones in L register are in a sequence, one of them must undergo register- 
raising. 2' i.e. L L — > H L 

or L H (with pitch change omitted) 
Finally we go back to the suffixed fonns of (21a (iv) and b(iv)), of which tlie tone 
pattern defies a straightforward spread from the first syllable. Although admittedly I do 
not have a perfect explanation for this case, I suspect the following process to be at 
work: after tonal copying, the tone on the suffix undergoes a register dissimilation, the 
output of which (i.e. a high level tone) then undergoes a contour fomiation due to 
constraint B (i.e. (27b)) in the language, eventually giving rise to a surface l!^35]. I 
further assume that the reason for (21a (vi) and b(v) not to go through the same processes 
is due to the "checked" tone involved: i.e.. since the tone copied is a reduced, short tone, 
it does not go through register assimilation nor meet the "high level tone" condition for 
constraint B to apply. Both hypotheses await further verifications. 

Despite tlie many aspects of Changzhi which remain perplexing, I have shown in 
this analysis that, contrary to Yip's 1992 claims, it is possible to consider both the 
sufftxed forms and the verbal reduplicates to have undergone tonal copying: the former 
mvolves full association of all tones copied, whereas the latter involves only partial 
association of the tones copied and a template for verbal reduplication. In addition, tone 
sandhi in disyllabic compoimds with identical underlying tones is tieated to be entirely 
different from the verbal reduplication process; i.e.. tonal alternations in disyllabic 
compounds (23) can be accounted for by a number of tone sandhi rules (e.g. register 
dissimilation and alternating contour), whereas tone sandhi of verbal reduplication is 
accounted for by an interaction of tonal copying and template, further cuiistrauied by (27b 
& a). Compared with previous analyses, this current analysis has been able to account 
for the most t\pes of tonal alt mations in Changzhi and to offer a more explanatory- 
account which shows the complex tone sandhi exhibited in Changzhi to result from the 
interaction of a number of prosodic constraints, conditions, and rules. 

3. Types of Tone Sandhi in Mandarin Dialects: Summary of Preliminary Findings 
3.1. Major types of processes. Despite the insufficient data and the preliminary 
namre. this smdy results in some interesting discoveries regarding the tone sandhi 
processes ui the Mandarin dialects. From the foregoing discussioas. it is quite obvious 
that dissimilation is the most powerful process in Mandarin tone sandhi. Following the 
practice of repiehenting tone by a register and a pitch feature tier (e.g. Yip 1980 and 
subsequent studies. Bao W90. Duanmu 1990, and Chang 1992). two major types of 
dissimilation may be categorized on the basis (;f the prosodic "level" (i.e. tier or 
dimension) at which the dissimilation cx:curs. 
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First at the register level, dissimilation is present in all four dialect groups. In the 
BF group, the well-known r3 sandhi in Beijing Mandarin Ls a cla.ssic example in which a 
L register is dissimilated to a H register when followed by another L tone. In Tianjin. a 
process comparable to the Beijing T3 sandhi is (2b); in addition, when two high falling 
tones are in a sequence, register dissimilation applies on the first syllable (as in (2c)). In 
the XxN dialect of Chengdu, the register value o( the final syllable is altered to be dilferent 
from that of the more stressed first syllable. In the JH dialect group, register 
dissimilation »xx;urs in Zhenjiang when two low falling tonei are m a sequence (i.e. 
(lOa)): in Hefei, the high checked tone is dissimilated to low when followed bv 
nonfalling high tones (i.e. (9e)). In the XB dialect group, both of the sandhi pnxesses 
observed in Xian (i.e. (II)) involve register dissimilation. Finally, register dissimilatiim 
is also observed in limited lexical cases in Pingyao (e.g. (19)) and in Changzhi (e.g. (32)) 
as we have examined in the preceding sections. 

Second, pitch dissimilation is also per\'asive across all dialect groups. Three 
ts'pes of pitch dissimilaton processes can be concluded for tlie dialects examined in this 
study: reassociation of the underlying pitch values, pitch insertion, and contour 
metathesis. 

Reassociation of the underlying pitch values is usually seen when the tone sandhi 
involves complex contour tones; examples are the T3 sandhi in Beijing and similar 
processes in Tianjin (2b), Xuzhou (3), Wuhan (7), and Hefei.(9a).-- Pitch insertion is 
found in Tianjin (i.e. rule (2a)). The third type of pitch dissimilation is contour 
metathesis, a very productive process in Mandarin dialects, particularly in those of the 
XB and JH subgroups. Similar to pitch reassociation. the process of contour inetatliesis 
involves the delinking of a pitch on the left plus a pitch spread from the right. In the JH 
group, metathesis is captured by (9c) in Hefei and ( 10a) in Zhenjiang. In the XB group, 
both classes of compounds examined in Pingyao require contour metathe&is (i.e."(13b) 
and (17)). Metathesis is also observed in the tendency for alternating contour (27c) in 
Changzhi and in the sequence of low tones in Xian ( 1 la). 

In sharj) contrast with dissimilation processes, assimilation in Mandarin has only 
limited application. 23 The best examples are those of register assimilation in Pingyao 
(i.e., (13c) and ( 18)). In addition, the contour forming tendency in Changzhi (27d) may 
also be considered as a minor assimilatory process. 

3.2. Tone sandhi tendencies related to stress, contour shape, and register. 
Due to the limited data at hand, we have been able to obser\'e <mly those more ob\ iou.s 
cases of stress-tone interaction, leaving open the large extent of this phenomenon for 
future research. Based on the available sources, the high level tone in Xuzhou 
reduplicates (in (5)) and the tonal alternations on the fmal syllables in Chengdu (6) and 
Kunming (8) are most likely to be due to weak stress. We may further speculate that 
what motivates the templates for verbal reduplicates in Changzhi (29) and for the 
triplicates in Xuzhou (4) and the contour simplifications in Zhenjiang ( I0b,c) and Hefei 
(9d) in the first place may be stress-related; however, this possibility must await further 
exporation and another forum to be discussed in. 

In terms of contour shape, the complex contour tone has a high tendency to 
undergo sandhi alternation, as evident in Beijing, Shenyang, Tianjin, Xuzhou, Wuhan, 
Changzhi, and Hefei examined above. Furthermore, there is a strong tendency for falling 
tone to undergo the metathesis when followed by another falling tone. With regard to 
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register dissimilation, the majority of cases involve a change of L register to H, although 
the reverse is also found in a couple of cases (i.e. Chengdu (6b) and Xian ( lib)). 

To sum up our findings, there is little doubt that the most general sandhi processes 
in Mandarin are register dissimilation, particularly the dissimilation of L registers, and the 
simplification of pitch contour, an alternation triggered by complex contour tones of a 
simplifying nature (e.g. reast,ociation of tlie imderlymg pitch values). In addition, 
metathesis is a productive process which is observed particularly in the XB and JH 
dialects. 

4. Theoretical Implications 

This preliminary study has presented a systematic account of the properties of 
Mandarin tone sandhi and offered a wider scope of study than that carried out by C. C. 
Cheng in 1966, one of the very few systematic studies of dialectal sandlii phenomena. 
Although much of the obser\'ation made by Cheng still holds true, the current smdy has 
further shed light on the great complexity in the XB dialects and, to some degree, the JH 
dialects (e.g. metathesis). 24 More i- iportantly. the tone sandhi properties discovered in 
this study are significant in oiir understanding of the nature of tone (i.e. the fumiai 
relation of tone to other phonological properties) and of the formal properties of a model 
of tonal representation. We discuss these implications in turn. 

4.1. The place of tone in phonology. From the current study, two general 
theoretical implications can be inferred. First, the predominantly dissimilatoiy nature of 
the tone sandhi rules suggests that tone as a phonological property' is more like stress (e.g. 
the dissimilator>' nature of "stress clash") than segments in nature. Second, the 
prevalence of contour metathesis suggests that at some level of pht>nological 
representation, the entire contour tone must be taken as a unit, since metathesis always 
involves the contour as a whole instead of just a single component in the contour.--"^ Both 
implications are crucial in determining the formal status of tone in the organi/atioii of 
phonology. The former raises the question whether a tonal representation should be more 
similar to that of stress than of "segments" (i.e. in the sense of SPE): in other words, the 
measure of representing tone simply under a tonal node (regardless of tlie natiTe of the 
node) in feature geometry rather than constructing a tonal structure along the line of 
metrical tree/grid is questionable. '^'^ On the other hand, the latter indicates that a higher 
level of representation, namely the syllable level, may be necessary in the representation 
of tone, in spite of the relative success of treating contour tones as sequences of tones, of 
which the tone-bearing unit is the mora (e.g. Duanmu 1990, Yip 1992). This latter 
implication lends full support to Chang (1992), in which tone is at the same time borne by 
the mora which is directly built on the slot anchoring the feature geometry with 
articulatory relevance and by a more dominant syllable level which tonal changes must 
observe. 

4.2. The accountability of the formal modal. In addition to aspects discussed 
above regarding the nature of tonal properties and the level of tonal representation in the 
overall phonological organization, there is ongoing discussion with regard to the power 
allowed for a model of tonal representation to predict possible types of processes (e.g. 
Yip 1989, 1992). The XB dialects of Pingyao and Changzhi examined in Section 2 offer 
some valuable insight into this Issue. Fii-st from Bao's ( 1990) analysis of the Type A (i.e. 
( l?c)), further strengthened by the present analysis of the Type B compounds in Pingyao 
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(i.e. ( 18)). it is clear that register spread (i.e. assimilation) is a major tonal process for 
which a tone model must provide account. The process of register spread forces the 
model to treat tonal register independent of tonal pitch so that as the register value 
assimilates, the pitch values remain unchanged (contrary to the prediction made by Yip's 
1989 model). Second from our discussion of Changzhi. we see that both of the Chanczhi 
suffixed forms (21) and verbal reduplicates (22) can be considered tt) have undergone a 
tonal copying process, and since reduplication involves a (segmental) copying process 
regardless, there is no need to insist on a process of "whole tone" spread for the 
derivations of reduplicates (cf. Yip 1992). In fact, there is so far no evidence that ' whole 
tone spread" cannot be equally replaced by a tonal copying procedure. 

5. Further Issues 

In this study 1 have attempted to look into a number of documented Mandarin 
dialects in order to better understand the nature of tone sandhi. Although a great deal has 
been revealed, our analyses of the dialects in question and knowledge of tone sandhi in 
general must be further amended by more detailed field investigation. As pointed out in 
3.2.. the likelihood of a prominetice-tone interaction needs to be seriously pursued both 
empiricaUy and theoretically. In addition, empirical questions can be raised as to why the 
observed tone sandiii tendencies occur the way they do, such as the register dissimilation 
which occui-s when two L-registered tones are contiguous aaid the contour metathesis 
when one falling tone is followed by another. I suspect that here both tendencies; may 
have phonetic bases; i.e., it is likely that L register dissimilation is due to peix:eptual 
caases, whereas the contour metathesis of falling tones is motivated by timing/duration 
concerns.-^ Finally as for the theory of tone, I have pointed out some important 
theoretical implications in Section 4; however, the exact organization of the phonological 
components involved for tonal representation (including tone features, tone- bearing units, 
and the prosodic structure) is still unsettled and demands the development of a more 
articulate, better integrated phonological theory. 



NOTES 

1 . There are. of coui '^e. exceptions to this statement. In Chinese languages, the 
well-known exception is Soutnem Min which basically involves widespread paradigmatic 
replacements of tone.s in grammatical contexts. 

2. The following four major dialect groups are generally recognized: Northern 
(Beifang guanhua; henceforth BF), Northwestern (Xibei guanhua; XB), Southwestern 
(Xinan guanhua; XN), and Eastern, i.e. the Yangtze and Huai Rivers region (Jianghuai 
guanhua; JH). For their geographical spans, see Norman (1988) and the summary in 
C"han2 (1992). 

3. The tonal categories are derived from historical development and used widely 
in field work to document the citation tones. The data of Xian. Chengdu and Wuhan are 
from "Yuan (i960). Hefei and Yangzhou are from Beijing Daxue's surveys (1964). 
Pingyao and Changzhi are from Hou (1980. F-83). Tianjin is from Li and I.iu (1985). 
Xuzhou is from Li ( 1985). Zhenjiang is from Zhang (1985). Note that the tonal categor>- 
5 which corf'sponds with the Qu-sheng category of Middle (Chinese is split into two 
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categories in Changzhi. Tone categories 7 and 8 are the Ru-sheng (checked syllables), of 
which the tonal duration is relatively abrupt and short. 

4. We ignore the domain of the sandhi rule application here for conciseness and 
also for the convenience of C(;mparison. Beijing T3 sandhi is treated as a register 
dissimilation rule which changes the low register ([21]) to a high register ([33]) when 
followed by a low-registered T3 (Chang 1992). 

5. Again, for conciseness, the discussion with regard to directionality of tonal 
domains and extrinsic ordering among these rules are bypassed here. I refer the 
interested readers to the Chen (1986) and the articles in the special feature of a 
symposium on Tianjin tone sandhi in the Journal of Chinese Linguistics 15.2. 

6. The notation used here in the rule formations is highly abbreviated and 
informal. H/l. and h/1 are used as "shorthand" device to indicate the upper and lower 
register values and the pitch distinctions of a tone; the analysis follows the Une of tone 
feature analysis originated by Yip ( 1980). 

7. This is somewhat reminiscent of many reduplicated forms in Beijing Mandarin 
in which the second reduplicated syllable or both syllables surface with high level tones 
and are considered as prosodically weak positions. 

8. One may saspect that the template originates from the triplication of the 
complex tone [213]; moreover, the tonal pattern of (4a) indicates a re- reduplication 
process from the right. Both may be supported by evidence from other Chinese dialects. 
The latter Is supported by the well-known triplication cases in Taiwanese, whereas the 
former by the Mandarin spoken in Taiwan, in which there exists a reduplication template 
[21-35] in baby talk, originating perhaps in the spreading of the complex contour [214] 
over the disyllables. Due to its complex contour melody and longer duration, a complex 
tone may be more susceptible than other tones to split up, thus creating a template. 

9. The examples here are given in Pinyin. Li (1985) uses Chinese characters to 
docimient these examples. 

10. I am indebted to Professor Wang Hongjun (p.c.) at Beijing University for 
providing the updated information of the Lexical Survey. 

11. For conciseness in exposition, Bao's rules are rewritten here with "shorthand" 
notations to avoid the complication of the formalism adopted by Bao, which bears no 
relevance to our discussion here. 

12. In formalizing his observation, Chen follows Yip's 1989 model which takes 
register to be the tonal root node. The formalism used and the discassion of the formal 
problems with this rule are omitted here. For details, see Chang (1992). 

13. Type A consists of compounds whose components form verb-object or 
subject-predicate relationships, whereas Type B consists of those whose components 
form relationships such as adjunct-argument, argument-complement, conjunct (parallel) 
structure, suffixed noun, reduplicated noun, and reduplicated adjectives (see (49) below 
for examples). Type C contains reduplicated verbs. All three types exhibit different 
sandhi patterns from one another. For conciseness, we limit our discussion to the Type B 
patterns here. 

14. Except for a minor difference is noticed between Type B [25-53] and its 
corresponding Type A pattern [35-423], The difference may nevertheless be considered a 
phonetic detail bearbig little significance. 
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15. For the sake of examining whether register neutralization exists in Type B. T 
hypothesize that the register value ot a rising tone is positively correlated with the pitch 
value of the preceding syllable; i.e., replace the rising tone syllable with a H register value 
if the preceding tone ends with h, and vice versa. Ill-formed derivations arise when this 
hypothetical process is applied either followed or preceded by metathesis (17). For a 
detailed exposition, see Chang (1992). 

16. There is a difference in the order in which the regi.ster dLssimilation nile 
applies in these two types of compounds. The register- lowering rule applies before 
metathesis in Type A. whereas L-raising applies after metathesis but before register 
assimilation in Type B. 

17. The fact that the two complex contour tones in Changzhi, [213] and [53?], 
may appear in the first syllable of a disyllabic string is perplexing. It is not clear whether 
the contours are fully realized phonetically and, if so, whether the syllables bearing them 
are lengthened. Moreover, [213] and [535] contours are documented for the checked 
suffix /tE?/ by Hou. Again, the phonetic detail is not available. One may speculate that 
since, according to Hou, the nominal suffix often takes two forms /tE/ and /tE?/, no 
glottal ending appears when the suffix bears complex contours. These phonetic details 
await further verification. 

18. Yip's rationale for rejecting tonal copying in (21) and for questioning the 
differences in tone sandhi between (22) and (23) is ill-conceived. Even if a copying 
process gives rise to the same base forms, it does not automatically follow that forms 
from different morphological categories are to undergo the same sandhi. According to 
the phonological theory conceptualized in lexical phonology, phonological rules have 
access to lexical/morphological information. 

19. As indicated by f25c) below, Bao must also assume a tonal copying process 
in reduplication; however, he did not make clear how the reduplication process copie<; 
tones. 

20. Segmental copying is omitted for conciseness. 

21. From the limited data given in Hou (1983). it seems that L register 
dissimilation occurs most regularly when the complex tone [213] is involved. Two 
patterns are given for the sequence of [213]: [213-53] and [35-213]. It is not entirely 
clear which pattern is preferreid. 

22. In the case of Tianjin, if the tone documented as [213] should be treated as 
[13] as some suggest (e.g. Shi (1990)), then (2b) involves only register dissimUation. 

23. This statement, of course, may be argued otherwise because it depends on 
how assimilation and dissimilation are defined. For instance, if one treats the pitch 
spread from the left in a metathesis as pitch assimilation, then it would seem that the 
assimilation processes in Mandarin should also include metathesis and thus are not so 
limited. However, here in my analysis, although contour tones are taken analytically as 
sequences of level tones, Uie entire tonal contour is taken into account as a unit: therefore, 
a change from falling to rising contour is taken to be a dissimilatory process. 

24. In Cheng's study (1966), data from only four dialects were available for 
analysis. On the basis of Xian, Shenyang, Chengdu and Beijing, he reaches the 
following conclusions: (1) when two low tones are in a sequence, the first one becomes 
high rising; (2) when two high falling tones aie in a sequence, the first one becomes low 
falling; (3) except for Chengdu, the sandhi occurs on the first syllable. In the current 
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study based on a wider source of data, we see that all three points are too simplified to be 
taken as entirely correct. 

25. In addition, the sandhi environments in the JH dialects of Hefei (i.e. (9d & e)) 
and Zhenjiang (i.e. ( lOb, c. & d)) also indicate the formation of a natural class based on 
the shape of the contour (e.g. falling). 

26. In addition, if a tone model represents tone only by laryngeal featuies under 
the relevant order of feature geometry, then it would not predict and naturally provide an 
account for the interaction between tone and stress which is represented by metrical 
structure built upon the skeletal tier (Liberman and Prince 1977 and subsequent works of 
Prince). However, such prosodic interaction has been obser\'ed in Chang (forthcoming), 
and therefore suggests a more complex view with regard to tonal representation. 

27. The tonal structure consisting of a syllable level is able to explain such 
syllable-observing processes such as metathesis. It also accounts for the phenomena 
obser\'ed in speech errors, such as the anticipation and perseveration of the entire contour 
tone observed in Tliai by Gandour ( 1976). 

28. Presumably L-registered tones tend to have less percepmal saliency than H 
tones, and falling tones tend to have relatively shorter duration than that of other full 
tones in Chinese. However, this awaits further verification. 
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l§S^-^-aSlEgMli§ilJ:9-^^llsIli§ ( tone language ) i^SlS^lls ( 
accent language ) o ^^Msf^mX^IM^^^MMa ( stress-accent ) , $n : 
( pitch-accent) , Pa - Bllo $D]I:k:^m^^^^^^MSI^M 

mWMmmmmmmB ( nontone language ) ^ilJ:^SM5ig afi^ii^i^±fiUmS 



t::scpnmm^ ( accent ) ^-*M:^wtiif>ig?£. tm&. mm'^mmmmo m 



fmrnmrnm^n^ - -) mm^mM.tm^ (more prominent) . ^^11 

mtmM^, mtmmmMmM^i s) m^^'^mm^mmm ( tonai spread 

^ ^yjB. myjB. mt. ^^^wmtob. n^itm^ 

1.0 mwmmmsm 

1.1 {mtimmmm'}^ 



^-mm'fmmm [55-31] wm. idm^m^m:^^m-m^^mmo 
mm zhang(i992) mm. mrxmm, r^is^wwarMmn, i^^m [55-31] 



S^ nyjmm^-{9.'^WM (Hashimoto 1973. Yue-Hashimoto 1987) . f^;? 

mm'MmiMm.^mm mmm mo ) . mmifms^mmmnmbmmm^wmo 

imBM^n. 1±UWM (1960) , Yue-Hashimoto (1987), Norman (1988) 
WlRm^^l5m'^-^m^1±m^=nm'^Wik-m=nm (Yue-Hashimoto 1987) o itlS 

(1985) n^m^ii^ mtJimmi^iKyjm) , mmmm^wMmo 



CoooO] # CoooO] # CoooO] # 

1.3 l!i?^^i?5)^ifs: 

1.3.1 mmm 

im'm\mmm--mmtimnmm. mmmnm^ (o. ) 



1.2 WM 
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mmrt-mmmmo iiu-iv-mijb mmyjs) ^x^j (o. ) r^j n^, 

tmMi^-mi^^^mm. mnM'^^immAmMwmm (trochee) o 
1.3.2 mnw^ 

imiyjB (^) mmm la mmARxmi't^mmmm , mmmm-^mu^m 

TuM'^mt, m^um'imm^^Bm'^^^it (un- stressed vowel reduction) 

J mm ; mm^iE^mwmmm, m ^m^j (oo) mmmn. mm 
tfi^tj . m rj^^j (0. ) mmmmn, mm m^mmmtj o mt:^^^. ^ 

5S]ffil!??^6{@SiJai5l[lt^S (lexically determined) . Silb^l^ilj^lJsBti. bES 
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1.4 mm 



mmM^mmm^mm^-mmi ( tonai spread ) o tmmB.ihmr±^:^s 
mmyj^o Yue-Hashimoto (1987) mBm&^yjmKm-^m^^^m'^mm^i'im^ 

1 -m^mmmmm'^^i^mt 













u 


b> 












c> 








r h 
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kheh^l? 












long^ 














r 1--^ 












k 1' 
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— 1 /A 



1 mmmtj:.!:\r\mwmmto (?n:s. np. ^?z'i!^w^mm9fno 

3 rEJLI^u^g. g5m«^lSl^lXau'§ls]^lt.^®'»o &u'S^i^'S'1?iniii$> Pain 



la 

b NP 



HP HP 



S 

VP 



NP 



S 

VP 



NP 



s 

VP 



ck:^W, 14*1. 
dOoO OoO OoO OoO 



S S 
HP VP VP 

OoO o o O 



la 

b NP NP 



d oOoO 
dd 



S 
VP 



o o o 0 
oOoO 



S S 
NP VP VP FP 

0 o o O . o o 0 . 



1.0 ^mmn^m^'^i 



mm 



;.i tMm^m^- 



la 'a^ 
b itPst^ o . 

c ^mmi^ o 0 



Mm mw- mm 

O. 0. 0. 

00 00 00 



mm^^r-m^. m ^m^. mm^ aa> o A^mm^mmm^^ ^- 
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0. 0 
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c ^"i-ait^^S 
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00 


000 


0 00 


00 


00 


000 


2a ^i§^iTl 






H-T 




rsm 


chhoef? 




h 

0 (U uo 


0. 


oO. 


0. . 


0. . 


0. 


0. 


0. . 



m^mm''^, mrmmi^mmm, m^^mmmm':}±r-m^^ (cheng im o 

4.0 ^mx±m^^Bm^':}mmm 

fill (transfer) 5IJ^itBI§flo 

4.1 Btmm'^mm 

Bmtmmmmtmmi^i'^mm^.m^ (pitch accent) o m.^mmi^-mm 
-Mj^±fi^wifi (mora) mm, ^Mmm^m^mu^mmo Ki£Wi«±w-s 

mkmm-m^Bo mm^r-m^m^m, i±Bm-^m o , i , ■ 2 . 3 , 4 . 
5,6. -m^if-, %rmm^^M'^m^, mmm^BmrnmBio ^^mm^m 

w-Mt, mtm^mBmm] -1 , -2 , -3 , -4 . -5 , -6 . "-^mim 

4.2 ^^MBtmBsMmwrniin 

mminBiS vdr.] , rmmp.Bimt^mB5^:^rm^tmmbno utr^-.i^ 



mm 






m 










• 


• 0 


• 


0 


• • 0 


• • O 0 


• • • • 0 




0 


• 


• • 




• • 


• • 


• • 


BIM hiftga 


toriftga 


sakuraftga 


toraodatiftga 


nihoN gamiftga 


murasaki irott ga 


-2/-1 0 


B»;6? 


0 




0 


^^M^w 0 


B4^5itt/0^ 0 


^Et*56» 0 


-2 01 




^b«;6? 01 




01 




75ih>^t}/0> 01 


-r— 01 


-2 02 




02 




02 


m^t'' 02 


C>-t)x)^PKlttx)^ 02 


^l^S)l^/Q»?ftz)^ 02 


-2 03 








03 


1P>0^ 03 


#^^ftz)^ 03 


TZTZ^iohX'^t)'' 03 


-2 04 










i^f)^ 04 


^^j:i^i:L' 04 


^i^b^l^«t^ 04 


-2 05 












05 


05 


-2 06 














:^?$^ft;6? 06 
















reN gou koku ttga 



-2 
-2 



-3 
-3 



03 
0 
O 



yuukiti ga 
MM^t^ 03 
#l]nft;^> 0 



i mo L. 



02 
02 
O 

• 

tottga 
01 



• • O 
• • • 

aNnaisyo ttga 
mm^ii^ 01 



02 
O 

• • • 

saNkasya kai hittga 



O 
0 



mp.Bmmwmm oi-oe MP.Bmmm'^im] 



-1--3 ^?^05§?M6tjs^f6i5B mmmmnmmm 

4.3 ^mutmm'^mh 
-^ m-^Bsm^mm^^ 

3. ffi#iat§i5.^i5ti1rfn]ff3J (accent shift) : .,a. -g^f 

b. mmmmlfo 
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1. m^l^T^mm.m^^m-^'iii (penultimate mora) o ii^l" -2 " 

2. imiJa^.^rmim:^-', ^Xm.mriM^ufm'^^^l^ (antepenult) o 

a. inmn^m^-mm^m^'^m'^m^mmmo 

b. mkmu,mmr.im\^mmp^QtM'^M^M^ ( voiceiess vowei ) m 

^m, m'^iimMmu^i^^^.:tmtmm. mmmmm^m, -3 m 

1. -{Hig^MF^^-MS^o 

2. 5g^Mmil?5mi4fi^lGH^ (HP. VP. S) .iJS-i^: HI§fi^lM?£mte:^& 

"^mmm-immr.fmmm.L B:^mwmmm^mm^^ 1957 -3ew 

/{fi-i}/ /{finti^}/ /{^-i} {{MS-,} 

r r 1- r > r 1^ r 

w^&mw inwm-jit, s^rnu ij^b^stt ti^eiiB» 

-f r \ h >-f \ I- ' 
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^" (1987 ¥Lij^'^fia3 ) 'Bf^m\ti^>^'mmrK.^^o ] 

nmmmm^m ( o ) mmo mm^Jtmmm'^m'^mA " oi " 
, ^nm'^mA " 02. 03. 04.... " m^o 

4.4 ^mBtm^mm-m^^ 

4.4.1 ffiifwfi^5bi^. ff^icse^ia 

mmmm^'^mUo 1± NP case marker. i^S. VP iJj S :^mm 

mm ( conjunction ) *^SB*i§fl^SSg#BUffi6rftlgi§. l£^f40lMfl*S$ 

NP. VP. s wMm^^^, mmmmMm':}^m^mmm^immmmm5\m^ 

(enclitic. =1.=2 . IljRtfK^. felt^^^^MlB6^5§^^a^^o itllillt.tS 

mwm^mmi>^o ^p^iifi : bat ^iiii? j e^i r^^j « r^j [ s« ^ » 



DTWLl : HPit CJ HPit ai^lti^tt ;a iSJ^-^tl la -2tt -2lt 



NP 



I 1 1 

NP CJ/K NP 
man and woman 




that man 's chi Id 
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m? cj » NP 



!)JPLl : 
(S^M^0,01..06^) 



LHIl L LIIH 
LHHIIH H LHH 



la » 02tf NA» 
b 

2a » 0 » NAtf 
b 



3)TyLl JPL2 : ftNPit CJ NP« 



LHL L HLL 
LHL H HLL 



la » -2=1 11 -3 « 
b 

la » -2tfl « -3 
b 



L HHHL L LHL 
L HHHL H LHL 

4.4.2 ^mmmn^nfnm^^m^mtimmmmmm 



2a » -2=1 « -2 « 
b 

2a « -2«1 « -2 » 
b 



Bl§ Jpn imm) Tw (©|§) ^-i^^giiTw-Jpn {^Mt) 



m^mim^ ^mnmmmsi ^mmmmm^ mmm 



5§W-p|||f£6^iB-^ {«NP^K} 



{K+NP«} 



{«NP»K»}{»NPtf=K»} 



sgtfjEfi^NP^^ m.'imm?^'^ -^mtL^^mmm 
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5.0 mmMm^mmwMi^m'^m^'mmmmim 

^mmi.mmmn np. vp. s sB^6^^^MM^-5?»tSii^ffnB^j%^ mm 



NP VP 

la ^tP4^ : [^^ mim] (*33333) 23 3 23 > 223 23 > 22223 (HH) 

2a [?@ W<m (*333) 3 23 > 223 

lb : M$f?S > M«« » ^ (-ai) 

2b m ^ « -f^ (-a) 

3 toriga . naku > toriganaku Bfff* tehl^L 

4 tooi . yama > too'iyama j^j^tt fi^ll! 

- -n > 1 

5 higa . otiru > higaotiru B0fi» f^lij 



--I - > 



■I 
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6 hana . saku > hanasaku Ibtl tehDtl 

7 nagaroru . ini.:u > nagarerumizu tohiiiCrt fi^7K 
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~i -1 > 

haruga .kita > 



harugakita 



6.0 mnmm^jii^m 



, ^mmmmmmo mmmmm^mmmi ( ni. wa. ga. o). -^-^mmm^ 



mm, 



6.1 r4mii*^iiii 



A *alil«i^ 



B 



on JR. m 



mm. 
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m-mm mm) mmm imm) . mm&ummmm^^m 

{'imHf^ 1970, Pulloyblank 1978 . W-di^Alii 1975 ^i^A 199J o 
*^it im-'^ 1983 ) o 
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it-p^^ xu-x §i§ 
MMM fiaSM xum "atmm mmm 



la. UllllSf^ 
b. llH^^-^^b [liH] - 

2a. M)lSW.^^.^fi.€f1^^ - 



+ 
+ 



+ 
+ 



+ 
+ 



+ 
+ 



+ 

+ 



+ 
+ 



[tin] mm\i'a%mm^mMmmmm^mmm'^. \^^mmm.^ 
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1 > 



mmmm 



VS7 





^MWM, mxmmm. --^timm, mmwmmimm^o ^^mbbmrn^m. 

Mmm^^jMt^, mmer mo, Hsieh 1990, Chang.C 1991] 



mmmM 



2. mmmm - m^PtvMm 

3. mmmm - mmm^mfnt 
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M 1 



1* ixmm^M'^s.mmm 

5- it^^llgS^WlEffifi^i^l^ 



1- {U 

3- {'^mmm 

4- {t^-JI^^^J 

5- it^'tfagSi 



B. 

c. ig^iniai^ 



D. n 



5- 
2- 
2- 

2+.3+ 



l-,2-,3- 

5+ 

!♦ 

5* 



A. -X^¥kW. 

C. ^55llMISI? 



i^Mem^it 2- 
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,S4 



4* 
4+ 



EH-?- a 



la ^^mms ? 

2a Slf^ffn^igm 



4a m^P.^M&.l 
b 



-m 



nop JCcir 



+ 
+ 



+ 
+ 



+ 
+ 



O ocj 



- dT + 

- dT + dJ 



5a 



7 lg§t^l$.^fi;^5^ > 



+ 

+ 
+ 

+ 



- dT 

- dT - dJ 



8a /tlH^I^IE'^ 

b mmmm^ 



♦ 



+ 



+ 



- dP - dJ 



±^dTi<cii^Dt^d§^fni. d?{mmm^r-m. A^\^mm-^.Bm^u. 



7.0 ASu^ 
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mmmm.^i{^^, ^^nm^'^m^ {wim\m\tL, mmmM.'^m 

m.^, z-mm^^fMmm, tM^tf^ifeiii-Ms. yu-m^. %m. mm. 
n, ^mmm.'m^'m^^mmm^o ^W:^-^B^mw^%m ( accent system) . 12 



^•m- 1985 WlhUt^^^o iB^lhimo 

1991 mm'/i^kTimm (-) o y^it^-m, gs-soHo 

nuM 1985 fMiij-^ff'^^.o traARaiii5?io 

1985 m\\yj-^:^^o ifiX'Mm. 

1988 m^Tia ' ±m\tmitmrv.o mmki^ihrnh 
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mn<^Jl 1089 ScSTj^r^io SmBf^^io 

WcSM 1982 mi\{7j^^d^sm-/j, Lq]llXK;ill!C?.to 

mm 1090 \imTj-M:^.. mrik^^thimt. 

1975 m^'u^^mm'iHi'mimo ^^mm^, 3.i:3-i5o 

3E 1993 ^j5-^-d^fi^!r.f]^-Mt^P±o «i'i1f4^^R)> rT23^m2llJj 175-192M 
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Introduction . 

Psycholinguistic studies of the spontaneous gestures that accompany 
speech, summarized in McNeill (1992), have shown that the study of 
gesture is a productive approach to understanding the cognitive processes 
that underlie speech production. These studies support the thesis that 
gesture emerges directly from the same underlying unit of thought as 
speech, without undergoing the linear segmentation required by the 
socially-constituted linguistic code. Analytic study of speech-accompanying 
gesture leads to the view that conceptual representations may be 
idiosyncratic and holistic in the same way gestures are. Further, gesture 
analysis yields a view of the conceptual representation underlying linguistic 
form that complements, enlarges upon and can in some particulars 
contradict a view based on speech alone. As such, the study of gesture 
offers significant clues to the nature of speakers' underlying conceptual 
representations as they evolve during speech production. 

This research has shown that speakers of languages as diverse as 
English, Japanese, Georgian, Hebrew, Turkana, Mandarin Chinese and 
others spontaneously gesticulate when speaking. Speakers of these 
languages perform highly similar gestures that combine with speech in the 
same ways for the same referential content. Further, such gesture has been 
found to function similarly across these languages in important ways. It 
serves, together with speech, to propel communication forward by marking 
those elements in evolving conceptual representations that most contrast 
with preceding elements. Speakers of all languages use gesture iconically 
and metaphorically. In addition, all speakers use gestural space cohesively 
to "map", geometrically, the relations among concrete referents and 
abstractions that play roles in the speakers' conceptual representations. 

Traditionally, the vocal speech stream has been the focus of studies 
of human communication. This linear, segmented, sequential linguistic 
code has been the primary (sometimes the only) input to the development 
of models of the human communicative competence that supports language 
comprehension and language production. Yet human communicative 
exchanges are richly textured, structured, contextualized and evolving 
events. The speech stream alone, because of its conventionalized and 
selective character, necessarily does not reflect many features of whole 
communicative events. Models of human communicative competence must 
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thus capture patterning at other levels of linguistic and conceptual analysis 
if they are to adequately describe the way communicative events evolve. 
The studies described here have involved detailed analyses of speech and 
gesture and the close relationship between the two. This research permits 
an expanded, richer view of communicative events by shedding the limiting 
focus on the sequential speech stream. 

Gesture is "so intimatsly integrated into the production of the spoken 
utterance that its planning and organization must proceed simultaneously 
with, if not in advance of, the production of speech itself" (Kendon, 1980). 
In fact, studies show that gestures almost always precede their co-referring 
speech, sometimes having their onset during the prior clause. Gestures 
together with speech reveal the nature and location of the "growth points" 
of utterances (McNeill, 1992). Similar to Vygotsky's (1986) 
"psychological predicate", the growth point is the minimal unit of verbal 
thought that retains the essential properties of the linguistic and imagistic 
whole of a thought. McNeill has shown that growth points contain the 
"new" information that the speaker will add to discourse: material that 
contrasts with what has gone before and that will form the point of 
departure for the next bundle of information to be conveyed. Linear, 
segmented, linguistic code is generated through a process of "unpacking" 
the material contained in these holistic growth points. According to 
McNeill, gestures embody these growth points and so may be used to 
identify and investigate them. 

Bates and MacWhinney (1991) note that a model of speech 
production must "account for the process by which native speakers select a 
set of expressions to convey meaning" in the context of real-time language 
use, and that the model "must have crosslinguistic generality." A useful 
way to think about the issues raised by the analytic study of gesture is in 
terms of Slobin's (1987) proposed dimension of "thinking for speaking." 
This is the structuring of thought into forms appropriate for linguistic 
organization. According to Slobin, "Thinking for speaking' involves 
picking those characteristics that (a) fit some conceptualization of [an] event 
and (b) are readily encodable in the language" (1987: 435). Analysis of 
gesture is a significant tool with which to study how conceptual 
representations evolve and how they drive speech production. It provides 
direct evidence about cognitive processes as they relate to speech 
production in real time. 

Comparative cross-language research on gesture examines the 
interface between conceptualization and language and permits us to 
distinguish what is universal in the content and structure of conceptual 
representations from what is influenced by grammatical and typological 
variations in language form. Typological features of different languages 
affect the ordering of reference and action in an utterance and languages 
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differ in how readily they encode certain features of a representation. The 
gestural component of a speech-gesture unit, however, reveals a view of 
conceptualization not filtered through these selective linguistic forms. 

The research presented here explores the nature of the relationship 
between conceptual representations and linguistic structure by comparing 
the speech and gesture of native speakers of typologically different 
languages from two unrelated language families: Mandarin Chinese and 
English. The speech and gesture of these two groups of speakers are 
compared in relation to two areas of difference in linguistic form between 
the two languages. 

Study I examines the spoken and gestural expression of verb aspect. 
Chinese is described as a "classic tenseless language" (Binnick, 1991) but it 
marks a range of aspectual distinctions grammatically and periphrastically. 
English marks both tense and aspect, but the two systems are confounded. 
Also, the expression of aspect in spoken English depends much more on 
periphrasis and the lexicon than it does in Chinese, which has a set of 
grammatical morphemes marking the major aspectual distinctions. Study I 
focuses on three common aspectual distinctions that are each marked by a 
single-syllable morpheme in Chinese: progressive ZAI, durative -ZHE and 
perfective -LE and examines the gestures that co-occur with these markers 
and their analogs in English. 

Study II examines the linguistic typological feature of Topic 
Prominence. Chinese, a "topic prominent" language, differs from a 
language like English, which is "subject prominent" (Li and Thompson, 
1976). As such, Chinese has several very frequently used structures that 
alter the ordering of reference and action at the sentence level during real- 
time speech production. The gestures of Chinese and English speakers are 
examined for clues as to the impact on the level of conceptual 
representation of such differing patterns of information flow. 

METHODS 

Subjects. 

The subjects are ten native Chinese speakers (two male and eight 
female) and ten native English speakers (two male and eight female), all 
adults. Of the Chinese subjects, six are from different regions of Mainland 
China and four are from Taiwan. The the data from the six Mainland 
subjects were collected shortly after the subjects came to the United States 
and all of these subjects were either university students or spouses of 
students. The four Taiwanese subjects were filmed in Taipei, Taiwan as 
part of a separate research project. All of the latter were middle class 
homemakers with small children. The present research assumes that in 
regard to the grammatical feature of aspect marking and the typological 
feature of Topic Prominence, all of the subjects are linguistically 
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comparable, even though some of them v^ere first language speakers of 
some Chinese language other than Mandarin. 

The English speakers were all members of the academic community 
at the University of Chicago. 

Stimuli . 

Two different types of stimuli were used to elicit speech: 

1) Cartoon: This is an American cartoon of a classic and well- 
known (to Americans) type. It features a cat and a bird and is about 5 
mmutes in length. It is limited in plot and consists of eight highly action- 
onented episodes in which "Sylvester" (the cat) attempts to capture and eat 

Tweety (the bird). Even though it is American, there is in fact very little 
English language in the cartoon. It has been shown to speakers of many 
different languages who have little or no knowledge of English and 
comprehension of the events and the story line is not impeded by this lack 
of language knowledge. 

2) Vignettes: These are a series of very short action sequences 
involving small characters or inanimate objects. A set of 40 vignettes was 
used for this study. Each is about 1.5 seconds long and depicts a character 
or object performing one or more actions. The vignettes are part of a test 
battery designed to elicit morphological marking on verbs of motion in 
signers of Amencan Sign Language (Supalla, Newport, Singleton, Supalla 
Met ay & Coulter, in press). The vignettes are designed to be viewed 
singly in a senes, with time after each one for the subject to describe what 
she has just seen. 

Procedure 

Elicoation. Subjects viewed the stimuli on videotape, and were 
then themselves videotaped describing what they had seen to a same-native- 
anguage interlocutor. In the case of the cartoon stimuli, subjects narrated 
to interlocutors who had not previously seen the cartoon. The subjects 
were instructed to be as complete as possible in their descriptions, so that 
the interlocutor would then be able to re-tell the story to a third person 
i welve of the subjects, six American and six Chinese, viewed the cartoon- 
the other eight subjects, four Americans and the four Taiwanese Chinese ' 
viewed the vignettes. 

All of the vignettes subjects, Chinese and American alike, found the 
vignettes stimuli somewhat bizarre, but generated a response to each 
vignette nonetheless. And of the subjects who viewed the cartoon, although 
Chinese subjects were much less familiar with the characters and 
conventions of American animated cartoons than were the American 
subjects, their narratives are quite comparable to the English ones overall 
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Speech transcription and oesture coding . From the videotaped 
responses, detailed transcriptions of all utterances, and the gestures that 
accompanied them were created. The speech for the Chinese cartoon and 
movie narrations was transcribed by a native speaker and checked by a 
fluent non-native speaker of the language. The vignettes data were 
transcribed initially by a fluent non-native speaker and checked for 
accuracy by a native speaker. The speech transcriptions include all pauses, 
breaths, and speech dysfluencies. The speech was coded for grammatical 
and narrative structure. The gestures were examined frame- by-frame on 
the videotapes and were coded for form, semantic content, and function in 
relation to the speech. Representational gestures were flagged. Finally, the 
timing of the gestures relative to the speech was exactly coded. All 
gesture coding was evaluated for accuracy by a second trained gesture 
coder. 

The Gesture Data . The gestures that provided the basis for 
analysis in this study were all "representational" gestures; that is, gestures 
that iconically represent some physical entity, or metaphorically represent 
an abstraction. Other gesture types did not figure very significantly in the 
analysis. Following McNeill (1992), the gestures are taken to consist of 
three primary phases: 1) preparation, when the gesturing hand moves 
from a rpst position, 2) stroke, the main representational movement, and 
3) retraction, when the hand returns to rest. All representational gestures 
are found to possess at least a stroke phase. Either of the other two phases 
may be overidden. The stroke is the crucial phase. Kinesically, it's the 
movement focus of the gesture; semantically, it is where the meaning of the 
gesture is revealed. 

This following excerpt from an English-speaking subject is of an 
utterance accompanied by a spontaneous gesture. The speech with which 
the gesture coincides is bracketed and the stroke phase is bolded. 

I dunno, {she slugs himl or throws him out the window. 

In the bracketed gesture, the preparation phase coincides with the word 
"she." The stroke phase consists of a single, rapid hand movement of the 
hand starting at the right and moving slightly downward to the left, an 
iconic gesture representing the action of slugging.. The retraction phase is 
on the word "him." 

In addition, gestural holds often occur during the course of a 
gesture. This is when the gesturing hand pauses momentarily in its motion 
and then resumes motion. The hand may pause between the preparation 
and stroke phases or between the stroke and the retraction phases of a 
gesture, or both, as the following example demonstrates. 



183- 



ERiC 



tJMmmmmm 



|... swings smac k into the bj |uilding 



In the bracketed gesture here, the underlined portions represent pie-stroke 
and post-stroke holds. The gesture holds for an instant coincident with the 
word "swings" before arcing downward across the gesture space in front of 
the speaker and to the left, an iconic gesture representing the cat swinging 
on a rope across the space between two buildings. There is another 
gestural hold before beginning the stroke phase of the next gesture. Kita 
(1991) determined that such holds are often a phenomenon of the timing 
relationship between speech and gesture. That is, because speech 
production is a temporally extended, linear, sequential unpacking process, 
whereas gesture is executable as a synthetic whole, gesture planning and 
execution usually precedes speech production. Thus, the preparation phase 
of a gesture will often begin some time before the co-referring portion of 
the spoken utterance. In such cases, the gesture may freeze in mid-motion, 
resuming only when the speaker/gesturer has come to the relevant portion 
of speech in the speech stream, at which point the semantically rich stroke 
phase executes concurrently with its co-referring speech form. On the 
basis of the data from this study I will suggest additional roles for gestural 
holds that are specifically related to grammatical and semantic features of 
the languages studied in this research. 

Study I: Verb Aspect Marking in Speech and Gesture 

First, I will describe the aspectual distinctions that are the focus of 
this study and then go over how Chinese and English differ with respect to 
aspect marking. 

While verb tense indexes the location of events in time, past, 
present or future, verb aspect indexes a view of events in time. Linguistic 
research (Comrie, 1981a; Binnick, 1991) indicates that the most 
fundamental of aspectual distinctions exhibited across all languages that do 
mark aspect is that between the perfective and imperfective. The 
perfective is described as taking an external viewpoint on events; the 
imperfective an internal viewpoint. The present study examines this 
distinction in speech and gesture as well as a further distinction between 
two imperfective aspects: progressive and durative. Although the latter 
terms are sometimes used interchangeably, I will follow Binnick (1991) in 
characterizing the progressive as momentary and the durative as having 
temporal extent. This contrast in imperfective aspects is captured in the 
following two English sentences: 

The aimel is coming. (progressive) 
The aimet comes ever nearer, (durative) 
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The timeline below lays out the distinction between the perfective 
and the two imperfective aspects. Progressive aspect situates the speaker at 
a point in the middle of an action in progress and durative lends the sense 
of an action having some duration, while perfective aspect situates the 
speaker outside the event, looking on it as a complete entity. 

I { 1 } I > 

1 begin to look I am looking (PROG) I slop looking I looked (PERF) 

< while looking ... (DUR) .- > 

As mentioned above, the marking of these three aspects in Chinese is 
accomplished through a set of three single-syllable morphemes. 
Progressive ZAI comes directly before the verb: 

Ta ZAI pao. 
He PROG run. 
He is running. 

Durative -ZHE and perfective -LE are both post-verbal morphemes. -ZHE 
immediately follows the verb: 

Ta bao-ZHE niao-long pao. 
He carry DUR birdcage run. 
He runs carrying the bird cage. 

LE may either come right after the verb or be placed at the end of the 
utterance: 

Ta jiu pacvLE. 
He then runs PERF. 
Then he runs ofl". 

The progressive in English is grammaticized similarly to Chinese, 
using the auxiliary "to be" and verb-fmal "-ing" marker: 

He is running. 

Durative is less explicit and depends on periphrasis and the presence of 
certain lexical items. In the speech samples I studied, I took phrases and 
lexical items such as these to express durativity: 

1. "While I As 1 was..., When..., In the pr(x;css of'...l eating, I choked on 
some spinach." 



-185- 



19b 



2. "With the wife freaking out, the husband takes up with his sccretar\'." 



3. VERB + VERB-ing forms such as: run screaming, exit laughing 

4. Keep VERB-ing forms: "She keeps wondering when she'll finish her 
dissertation." 

5. Lexical: "skid", "rotate", "deteriorate", "roll" 

Perfective is difficult to pin down in English and c ften an "external 
view of events" is not unambiguously retrievable from tht level of 
linguistic form. Further, perfectivity is confounded with p'ist tense 
marking in English. For the purposes of this study I relial on verbs that I 
judged to have an inherent perfective semantic as well as verb+particle 
constructions where the particle added perfectivity to the semantic 
specification of the verb. Compare, for example: 

hit vs. pummel 

eat up vs. eat 

end vs. draw to a close 

fall into vs. fall 

Coding . Several elements of gesture form were coded to provide 
the basis for the analysis in Study I. The form of the gesture stroke was 
coded. Stroke phases were coded as being single or multi-stroke. A single 
stroke is one uninterrupted movement that follows a single trajectory. A 
multi- or "extendable" stroke is one consisting of some kind of repeating 
motion that has the potential to continue uninterrupted for any length of 
time. For this kind of stroke, there is no point at which the stroke 
necessarily must stop. 

The durations of gesture strokes were recorded. The presence and 
durations of all gestural holds were coded. 

In addition, several indices of gestural complexity were coded: 
whether the gesture was one- or two-handed; if two-handed, whether the 
hands were "mirror images" of each other or assumed contrasting 
positions. The presence or absence of the expression of path, trajectory 
and manner marking relating to the action being gestured was noted. The 
presence or absence of "item marking" in gestures was noted; that is, I 
asked the question: Were any features of the items or objects about which 
the speakei was speaking revealed in the configuration of the gesture? An 
example of an action gesture that is not item-marked would be the tracing 
of an imaginary path of motion with the tip of the index finger. Such a 
gesture could be marked for item if, instead of a point, a balled-up fist 
(representing a ball, for instance) were to trace the path of motion, instead. 
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Analysis. Since aspectual distinctions are more simply and clearly 
marked in Chinese than in English, the data from the Chinese subjects were 
analyzed first to determine whether any distinct gestural forms coincide 
with the use of these markers. The American subjects' gestures from 
analogous speech contexts were then compared to those of the Chinese. 

First, aspect marking in speech was examined to determine if 
subjects tended to prefer one aspectual viewpoint over another or if the 
stimuli differed in terms of the aspect marking they tended to elicit. 

Next, the spontaneous gestures that co-occured with aspect-marked 
speech were examined. To locate possible differences among the gestures 
accompanying different verb aspects, all of the utterances in the data set 
that contained spoken aspect in some form were extracted and a subset of 
these was analyzed. 

The subset to be analyzed was extracted according to the following 
procedure. First, all the aspect-marked utterances that were accompanied 
by a representational gesture of some sort were extracted. The 
accompanying gestures were examined and any that seemed to express 
some portion of the semantic content of the verb in the utterance were 
selected. This selection criterion excluded any gestures that were 
semantically related only to a thing or item being described in the utterance 
as opposed to the verb related to that thing or item. So for instance, if the 
subject said, "And the bird dropped a bowling ball down the drainpipe", 
the accompanying gesture might reference only features of the bowling ball 
and not the dropping. The latter sort of gestures were excluded. Only 
utterances accompanied by verb-related gestures were retained for the 
analysis in Study I. 

Based on this criterion, 20 perfectively marked utterance/gesture 
pairs and 20 imperfectively marked utterance/gesture pairs were extracted 
at random from the Chinese data, with the criterion of not taking more 
than five utterances of one type from any one subject. The set of 
imperfectively marked utterance/gesture pairs consisted of ten utterances 
marked with progressive ZAI and ten marked with durative -ZHE. 

To this set of 40 Chinese utterance/gesture pairs was added an 
analogous set from the English data set. A subset of all utterance/gesture 
pairs from the English data set was extracted according to the same criteria 
described above for the Chinese data. There is some difficulty, though, in 
judging the spoken aspect of some of the utterances in English, since, with 
the exception of the progressive, aspect is not unambiguously marked, as 
was outlined above. To extract appropriate samples, I relied on lexical 
semantics for the perfective and to some extent for the durative, although 
for the durative I was aided by key words and phrases that denote 
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durativiiy (e.g. "while", "as", and so on). Nevertheless, it is often difficult 
to judge aspectual content from linguistic fonr alone in English. 

Results. The speech data are examined first. Across the Chinese 
speakers, use of the -LE perfective aspect marker was more than twice as 
common as the two imperfective markers combined, and within the 
perfective, the durative -ZHE marker was somewhat more common than 
the progressive ZAI marker. However, some speakers showed a marked 
preference for framing their responses in one aspect over another. 

Interestingly, the choice of aspect marking in speech for both groups 
of speakers was found to be largely independent of the particular stimulus 
being described. This was particularly clear in the data from the vignettes, 
where a given vignette might elicit responses from different subjects 
framed in terms of either perfective or imperfective aspects. 

The gesture data were analyzed next. Analysis of the gestures in the 
extracted set of 80 Chinese and English utterance/gesture pairs shows that 
there were differences in timing and form of gestures related to aspect 
marking in the speech. 

First, there was a difference in gesture stroke durations between the 
perfectively and the imperfectively marked speech contexts. The gestures 
coinciding with perfective marking in speech were much shorter than those 
coinciding with the imperfective. 

Table lA. CHINESE 
Mean gesture stroke durations in msecs.: perfective/imperfective 

Perfective: mean = .22 std.dev. = .111 

Imperfective: mean = .707 sfd.dev. = .442 

(p < .0(X)1, significant) 



Table IB. ENGLISH 
Mean gesture stroke durations in msecs.: perfective/imperfective 

Perfective: mean = .275 sld.dev.= .149 

Imperfective: mean = .580 std.dev. = .502 

(p < .013, significant) 

Tables lA and IB show the mean stroke duration of gestures accompanying 
spoken perfective and imperfective aspects in Chinese and English. Within 
each language the difference in the mean values of stroke durations of 
gestures marking these aspectual distinctions was highly statistically 
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significant. A two-way analysis of variance comparing these stroke 
durations in the two languages yielded a significant effect for aspect (p < 
.0001) and no effect for language. 

Tables 2A and 2B show the mean gesture stroke durations for the 
gestures accompanying spoken progressive and durative, the two 
imperfective aspects. TWs difference in stroke duration was not 
statistically significant for either language. 

Table 2A. CHINESE 
Mean gesture stroke durations in msecs.: progressive/durative 

Progressive: mean = .819 std.dev. = .497 

Durative: mean = .596 std.dev. = .372 

(p < .2706, not significant) 

Table 2B. ENGLISH 
Mean gesture stroke durations in msecs.: progressive/durati\ e 

Progressive: mean = .679 std.dev. = .590 

Durative: mean = .480 std.dev. = .401 

(p < .3899, not signif icant) 

Note that the difference in gesture duration between the perfective 
and imperfective speech contexts in Chinese cannot be explained by 
differences in the durations of the accompanying spoken utterances. Since 
all three of the Chinese aspect markers studied are single-syllable 
morphemes accompanying the verb, they all require approximately the 
same amount of time to utter. As for the English, if duration of the 
accompanying spoken utterance were a factor in stroke duration, we might 
expect that durative, which is often periphrastic, would take the longest 
time to utter and generate longer stroke durations as a result. In fact, 
stroke durations for gestures accompanying duratively marked speech in 
English are shorter on average than for the progressive, just as in the case 
of Chinese. 

These results, then, point to a quantitative difference in gesture 
forms. More striking {perhaps than the differences in stroke duration are 
the qualitative differences in gesture forms that accompany spoken aspect. 
All of the 40 gestures accompany iiig spoken perfective aspect in Chinese 
and. English used in this analysis were single stroke gestures. The gestures 
typically were very simple in form. More often than not they marked only 
the trajectory of some falling or flying object. In contrast, 85% of 
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gestures accompanying spoken progressive aspect were multi-stroke or 
"extendable" gestures. These gestures are often marked for manner and 
path of movement. For example, one subject's utterance in response to a 
vignette, 

A ml an is rolling across the screen ] 

is acompanied by a gesture in which the forearm, representing the body of 
a toy man, loops across the subject's gesture space in a path and manner 
similar to that of the figure seen in the vignette. 

Finally, as for the gestures that occurred in the presence of 
duratively marked speech, what was most notable was a sustained quality of 
the gesture, either in a lengthened stroke phase or in a long hold phase that 
might extend over more than one spoken clause. 

Table 3 provides a comprehensive look at the qualitative differences 
among the gestural forms that accompany the different spoken aspects. In 
this table, two different types of holds are distinguished. Here the holds 
that Kita (1991) identified are referred to as "syntactic holds", because 
these holds "wait for speech to catch up"; that is, the timing of their 
execution is dependent on the syntax of an utterance. Whereas the holds 
that occur with the duratively marked utterances are of a different kind. 
The holds in these gestures seem to have something to do with the semantic 
content of the verb itself and so they are referred to as "semantic holds". 
Such gestural holds are found to extend beyond any kind of distinct 
syntactic boundary and as such do not share- \he same timing relationship 
with utterance structure that Kita observed for the syntactic holds. 



Table 3A. CHINESE & ENGLISH 
Qualitative differences in gesture form in differing spoken aspect contexts. 



PERFECTIVE 
single stroke 
syntactic holds 

low complexity: 

1 event 
1 or 2-handcd 
mirror-hand gestures 
lecist manner marking 
brief trajectory marking 
little item maildng 



PROGRESSIVE 
multi-stroke 
syntactic holds 

medium complexity: 

1 event 
1 or 2-handed 
mirroi-hand gestures 
most manner marking 
path marking 
little item marking 



DURATIVE 



semantic holds 

high complexity: 

2 or more events 
2-handed 
contrast gestures 



item marking 
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The last feature that distinguishes the gestures coinciding with 
different aspect marking in speech is gestural complexity. I note that the 
gestures that accompanied the perfective tended to deal only with one 
event. They may be one or two-handed, but if two-handed, the hands 
typically are mirror images of one another; for example, as in two hands 
together suggesting the shape of a bowling ball. Perfective has the least 
manner marking of all the tyf)es of gesture. It tends to mark just trajectory 
rather than full path, and item marking is infrequent. 

The progressive gestures, as with the perfective, typically describe 
just one event and if both hands are used, the hands are mirror images. 
Progressive aspect marking in speech comes packaged together with the 
most manner and path marking in gesture. And again, as with perfective, 
there is overall little item marking. 

While the primary thing that distinguished the gestures 
accompanying duratively marked speech is the occurence of semantic 
holds, these gestures also exhibited a higher degree of complexity. They 
often depicted two or more events, which is understandable in light of the 
semantics of durativity: there often is a background action with a 
foregrounded action taking place concurrently. When there is durative 
marking in the accompanying speech, there is more of a tendency for two- 
handed gestures to occur than is true in the presence of the other two aspect 
markers, and when the gestures are two-handed, the hands are often 
contrasting, rather than mirror images of one another. That is, one hand 
will represent one function, the other hand another, and the two stand in 
relation to each other somehow. Further, item-markiii,; is very common in 
gestures that coincide with the durative in speech. Overall, there seems to 
be a more extensive and more cohesive use of gesture space in these 
gestures. 

Discussioa Conceptual representation of verb aspect during real- 
time language production appears unaffected by differences between 
languages in spoken aspect marking. The results presented here lend 
support to the thesis that, at least in some respects, conceptual 
representations, as indexed by gestural forms, develop independently of 
specific linguistic forms. 

Study II: Topic Prominence and Gesture : 

Within the spoken utterance, typological features of different 
languages such as standard word order and topic prominence affect the 
flow of information. Mandarin Chinese and English differ on the 
linguistic-typological dimension of topic prominence (Li and Thompson, 
1976). This is determined By ffie preponderance of certain types of 
utterance structures in the two langages. Study II presents another test of 
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the potential of using gesture analysis as a tool to gain access to the level of 
conceptual. representation. It is hypothesized that topic prominence 
features in Chinese, expressed in topic/comment and various object- 
fronting utterance structures, may focus the attention of native Chinese 
speakers on isolable elements at the level of conceptual representation, and 
on a consistent separation of elements from process rather than a 
unification of elements and process. 

English has a basic SVO utterance structure (Comrie, 1981b) and 
Chinese is usually also categorized as an SVO language. Li and Thompson 
(1981) maintain, however, that Chinese is better viewed as unclassifiable 
relative to this linguistic typological feature, since the subject/predicate 
relations on which such a categorization is based in other languages are 
such tenous factors in Chinese grammar. Speakers of the language though, 
certainly have access to this basic word order pattern. 

More significant is the fact that Chinese is a topic prominent 
language, with several commonly used stmctures that alter the ordering of 
reference and action. These include the object fronting BA construction, 
the resultative and directional verb compounds (RVC and DVC) with 
which it often occurs, and topic/comment structures. Of these, English 
makes available the topic/comment option as well, however it is not used 
with any frequency as it is in Chinese. By way of illustration, in all of the 
English-language data examined for this study, there was only one instance 
of a topic/comment structure. Following are some examples of utterance 
structures that were used in this analysis. In the first example, the BA 
construction moves the direct object into a position in front of the verb, in 
this case a DVC: 

Ta BA qiu diao-xia-qu. 
He OBJ ball threw-down-go 
He threw the ball down. 

In a topic/comment structure, any grammatical element of the 
sentence, or elements not grammatically related to the sentence may appear 
in utterance-initial position: 

Nei xiao niao, mao qiang-dao shou. 
That little bird, cat grabs-to-hand. 
Now that bird, the cat grabs him. 

Data . The analysis for Study II is based on a total of 120 
utterance/gesture pairs from the Chinese and English-speaking subjects. 
From the Chinese narratives, 60 single clause utterances that co-occurred 
with representational gestures were selected. Thirty of these were standard 
SVO utterances, and thirty were non-SVO utterances. Of the latter, sixteen 
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utterances contained BA-constructions, thirteen were topic/comment 
utterances, and two were other object fronting utterances. The selection 
was not balanced across subjects, although several utterance/gesture 
samples were taken from every subject. The goal was to find the samples 
of the target utterance structures that were accompanied by reliably 
codable representational gestures. 

Then, 60 single-clause utterance/gesture pairs were selected from the 
English narratives, controlling for equivalent or closely equivalent 
informational content to those selected from the Chinese narratives. All of 
these were SVO utterances, since, as mentioned above, there was only one 
non-SVO utterance in the corpus. 

Analysis. The gesture from each utterance/gesture pair was 
analyzed as belonging to one of these three categories: 

1 ) Item only gestures encoded only features of some item or 
object in the discourse context; for instance, the shape of a bird cage or the 
extent of a wire. 

2) Action only gestures encoded only features of an action; for 
instance, the trajectory, path or manner of a movement. 

3) Incorporating gestures encoded features of both items and 
actions in the discourse context. A portion of the gestures in this category 
were termed "total enactment" gestures since they encoded features of an 
item, action and character. The gestures included in this latter sub- 
category were those in which the subject actually pantomimed the action 
being described. 

Results. First, the overall number of gestures in both the English 
and Chinese utterances that encoded Item Only, Action Only, and Item and 
Action Incorporated were counted. Table 4 shows that the total number of 
gestures is greater than the number of utterances for both English and 
Chinese. This is simply because some of the utterances contained more 
than one representational gesture. Considering the numbers as a 
proportion of total gestures, it's clear that there are differences between the 
English and Chinese gesture production both for Item Only gestures, and 
Incorporated gestures. The largest difference is for the Item Only 
category, with only six gestures in the English sample of this category, 
compared to thirty-one gestures in the Chinese sample. 
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Tabic 4. GESTURE CONTENT: ENGLISH vs. CHINESE: All utterance types 





ITEM 
ONLY 


ACT 
ONLY 


ITEM & ACT TOTAL 
INCORPORATED GESTURES 


ENGLISH 


6 


16 


43 65 


PERCENT 


9% 


25% 


66% 


CHINESE 


31 


12 


30 73 


PERCENT 


42% 


16% 


41% 


Table 5 compares the two Chinese sentence types studied: SVO and 
non-SVO. It appears that there is no effect of sentence type on feature 
encoding in gesture. Item Only gestures seem just as likely to occur in 
SVO sentences as non-SVO sentences. 


Tables. GESTURE CONTENT: 


CHINESE SVO vs. NON-SVO 




ITEM 
ONLY 


ACT 
ONLY 


ITEM & ACT TOTAL 
INCORPORATED GESTURES 


SVO 


15 


4 


13 32 


PERCENT 


47% 


13% 


41% 


non-SVO 


16 


8 


17 41 


PERCENT 


39% 


20% 


41% 



Table 6 is based on all 120 utterance/gesture pairs and breaks down 
the data on just the incorporated gestures into three categories reflecting 
encoding of features of the different syntactic categories: gestures that 
incorporate features of the direct object of the verb, gestures that 
incorporate features of a subject, and those that incorporate features of 
both subject and object. Though the numbers are small, here the Chinese 
show a stronger tendency than Americans to selectively incorporate object 
features in their action gestures. Notice also that Americans are more than 
two times as likely to produce total enactment gestures, the whole-body 
pantomimics that are the most extreme form of incorporation. 
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TABLE 6. INCORPORATED GESTURES ONLY: ENGLISH vs. CHINESE 



OBJECT OBJECT SUBJ.& OBJ. TOTAL 

+ ACTION + ACTION + ACTION GESTURES 



ENGLISH 3 12 28 43 

PERCENT 7% 28% 65% 

CHINESE 9 10 11 30 

PERCENT 30% 33% 37% 



The final table. Table 7, displays the same counts as Table 6, but 
comparing across the two categories of Chinese sentence. These are the 
snnallest numbers yet, since the Chinese speakers didn't produced that many 
Incorporated gestures. Here the patterns of feature encoding in the SVO 
and non-SVO sentences are nndrror images of one another. The action 
gestures accompanying SVO syntax are more likely to incorporate subject 
features, whereas the action gestures accompanying non-SVO sentences are 
more likely to incorporate object features. 



Table 7. INCORPORATED GESTURES ONLY: CHINESE: SVO vs. non-SVO 

OBJECT SUBJECT SUBJ.& OBJ. TOTAL. 
+ ACTION + ACTION + ACTION 

SVO 1 8 4 13 

PERCENT 8% 62% 31% 

non-SVO 8 2 7 17 

PERCENT 47%^ 12% 41% 

Discussioa The results reported above are generally consistent with 
the hypothesis that the topic prominence feature of Chinese m?.y increase 
the salience of topics and objects of actions for speakers of that language, 
causing them to produce Item Only gestures that encode features of 
syntactic objects and utterance topics in higher proportion than English 
speakers. Interestingly, as the results displayed in Table 5 comparing SVO 
and non-SVO utterance structures indicate, this pattern of gesture appears 
not to be conditioned sentential syntax, since Item Only gestures are no 
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more likely in the context of non-SVO utterances. This suggests that these 
gestures reveal a general patterning on the level of conceptual 
representation rather than a pattern of syntactic thinking emerging in the 
gesture channel during speaking under the control of the linguistic code. 

Representational gestures of the sort defined here provide clues to 
how speakers sort out and organize the domain of referents during real- 
time speech production, ^hese and other observations suggest that some 
features of linguistic structure do emerge in gesture. 

General Discussion . 

Cross-language research into these phenomena provides a means of 
determining the extent to which conceptualization is related to linguistic 
representation in such areas of grammatical difference between languages. 
Since languages differ in how readily they encode certain features of a 
conceptual representation, spoken utterances in different languages will 
differ in the features they select or highlight. The gestural component of a 
speech-gesture unit, however, reveals a view of conceptualization that has 
not been filtered by these linguistic forms. 

The research presented in Study I lends support to the thesis that, in 
some respects, conceptual representations develop independently of specific 
linguistic forms in real-time speech production. However Study 11 
revealed what appears to be an area where features of conceptual 
representations do correlate with differences ii^ liagujstic form. The 
Chinese speakers show a preponderance of item-marking gestures over 
action-marking gestures relative to the English speakers, suggesting a 
privileged role of isolable entities in the conceptual representations of the 
Chinese. 

There may therefore be areas of conceptual representation with 
which linguistic representation does not interfere, as appears to be the case 
case with aspectual distinctions, as outlined above. These may be cognitive 
universals. There may exist other areas, however, where linguistic forms 
do influence developing conceptual representations, at least in the context 
of on-line speech production. The analytic study of gesture is a 
methodology that may profitably be used to examine this kind of "thinking 
for speaking." 
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® lit -^im^ 6^ ^ M 4 # ^i^tai 

( ® X iiisn ^ ^ - ) 

^ -i:! 3^ 



» (?P;S:^ base form) » ^:|^;^-fl:7^&6^ 

(?P^;T:t^ derived form) Mt;ll«^tf tb#5c^5£B#^^^X® 

+ (_L^-^ 'sjid) > ^tf 'chi < 'sji, shi < 

*sji Se < *sij » |5] tS] ^ ;f rf: , ^ i|i ^ ^ _|- 1| ^/j ^fj^- sji 

fif sji 2.10 - - ?i > 

'fr^llt^'f (Nevsky 1960: I -418) 

';^'<l;|^i.'^ it^^S (Nevsky 1960:1 -405) 
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m sji 1.30 ^^-^j 

fnt- (Nevsky 1960: I -387) 

^ Nevsky ^(]&^m^R^f^i&J^^*^-'i®^ ( f ^ ) th^lLi^^^J^I^^^'tC 

^'h-'O^J^ ' ©iff t #j|^tf ""^f J (Jii^-I- *sjin) > a.tf 
shing <'sjing ^m^U sac < 'sik ^ j PlM^t^^ ♦ 

II Sji 1.11 ^^^m-^i-ihj 

lit ill 



iXt U > m% (Nevsky 1960: I -478) 

g sji 1.30 r^^j 

U ITl (Nevsky 1960: 1 -487) 

\dti^^^^^ ^Hk ( (^^> 39.123) 
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nM- ' @A ^Ol- sji # 1$ sji ' ^ sji ^ 

^^ + — ;5L±^g+^ (^x2.104^) ^x;5Li5<g-f^g^+- 



®S:i§-|-f8##6^jJl|- » 4;(tig^^:^ii^x(Gong 1988, 1989, ^ 1993) 



2.1 ;fe®j:-Oi^t ' ^/Vf^^^+WfM^y,^+Egf^^^Eg+^^r«^til 

(*t^+-i8#^ =-+f^^r«^^jc#-3EC^^.J) » 
+ — i^vx^^ =.+:r^-g||#^^-i-^f^^:^;fgf5](Gong- 1989:20-26) o 



(*t^ =- + f^^^ + ' Gong 1993:947- 

952 ) ' ^ffl A > B > C'^^W^^t^m ° 



R.ll [1.11-2.10] -ji — 
^If sji 
ic sji 



R.31 [1.30-2.28] -ji 

ff'i' sji ^?e.j 

g sji 
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Tt'^'^-k^mmiiGoug 1994) » U$,&XMtm^'\m(J^^Ai^- - ^ - ^ 

A B C 

MTL-g— ^« R.8 [1.8-2.7] e R.34 [1.33-2.30] ej 

M7L-g--^<fi R.9 [1.9-2.8] ie R.35 [1.34-2.31] iej 

M7L-g-=.^<8 R.IO [1.10-2.9] ji R.30 [1.29-2.27] ji R.36 [1.35-2.32] jij 

R.U [1.11-2.10] ji R.31 [1.30-2.28] ji R.37 [1.36-2.33] jij 

*7L-g— f« R.12 [1.12-2.11] ee R.38 [1.37-2.34] eej 

-^7L-g--flS R.13 [1.13] iee R.39 [1.38] ieej 

-fe7C-g-=.^<^ R.14 [1.14-2.12] jii R.40 [1.39-2.35] jiij 

;^x5I-=-tr/f^6^jR.inif sji > ft s j i ""^ > l^j -R.Sl fn^sji 

4^° 



4'feB A - BC6^j<^:;^ ° 

3.1 MTL-t— ^^8^f-U<J#^ 

^#?'J6<)<^:>^#>-Hij » ^iaR.8^R.34ir^Jt.6<jBm^m4R.28 [ 1.27-2.25 ] 

3° ^f}kT-^rf\^^'^.2%^ims.^^i^mmf' ° 
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a) R.28 # R.8 

R.28 lha 1.27 *4'(S.4764)' 
R.8 Ul Ihe 1.8 *.*g(S.5551) 

iXm (W.2856, W.2857)' Iha-lhe ^^i^^ ^ ^Jk^ ^'^mj ' 

^ik {y^-:m%} m 12.141;s.i^ 12.142° 

R.28 iit Ida 2.25 4il^(S.3408) in^^} (49B6);iA i^k Ide 
R.8 t4 Ide 2.7 4a^(S.4873) <|S]-|-> (52B3);i;fe- 'ifii Ida 
(^;^> (W.1754, W.2149) Ida-lde ^^i^ffl ^ ^^m- 

(R.28);S.^ ' A(R.8);S.^ o ^:5^|^j^.iia , ^.JP^T <l51-g-> <X)^> 
it^x^t^it*f (Nevsky 1960:11-60 ' <ISS*i^> 346-4) mn^-^^ 

b) R.28 # R.34 

R.28 ^4 YW9 1.27 Pi(S.3635) <|S]^> (42B4);iA & ywej 

R.34 £t Ywej 1.33 Pi - -f - 3?!^(S.1964) <|S]^> (42A4);i:^ ^ ywa 
<x;^> (W.938) YW9-Ywej » <^tJ^> (295) Pif- o 

R.28 i| S9 2.25 j-t- if - ;t(S.4149) cf. seng-po clean 
R.34 sej 1.33 ;f - ;f (S.1855) bseng-po 

R.28 in pa 1.27 :^(S.3941) 
R.34 k% pej 1.33 *^(S.3476) 

(^;^> (W.870) P9 1=^ 4 pt] {^) o 

(R.28)^C(R.34)i^;^ ' 3.B^^C^^k ' - £.^^imX$kXit)^^^m 



3 Sofronov(1968)^*.i*«i ° 
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3.2 ^a.7t^^^m^mi}^^ 

i^mn^^>^Jfi^-M ' l^R.9;5LR.35j^Ji6<3BII8iJMm;^R.29 [ 1.28-2.26 ] 
i9o ^^;&R.29tafJ'%.<ii-m.9#^6<j^-J-^ ' vX;5L=-.^#R.35#^6<j^'Jf-° 

a) R.29 ^ R.9 

R.29 m ki9 1.28 "H'fCS.OlSS) m^} (24Bl);ii 6^fj kie 
R.9 ^ kie 1.9 ''H-'f (S.0560) <I§l-|-> (24Bl);i;& r^R kis 

<x>^> (W.286, W.943) kis-kie ' *4^lb ^"H-f j ° 

5L^^ Nevsky (1960: I -548);5L Kepping (1979:429, No. 734) o 

R.29 1# khiw9 1.28 * WiJ-5t(S.2182) <I§l-|-> (25B4);ii |# khiwe 
R.9 1^ khiwe 1.9 ' «-J -5^(8.2397 ) <|51-|-> (28A2);i;& khiwa 

<x>§:> (W.299, W.961) khiw9-khiwe ^^i^ffl ' ^'S'J^j^j ^' 
' *P;^B(R.29);&Bt ' A(R.9)^<|L°R.29-R.9itffl6^is] 

R.29? M'dziwa? 1.28? (t) (S.5336) <Is]-g-> (41A3)5iiL ^Mziwe 
R.9 {i^ dziwe 1.9 (t) (S.3853) <iS]-|-> (39Bl);i;& mdiiwa 
<^>^> (W.2764) dziw9-diiwe J^^ik^l im^} (291-5)^^ 

b) R.29 ^ R.35 

R.29 dzi9 2.26 i€(S.5338) <Is]-|-> (34A3);iiL It dziej 
R.35 It 'dziej 2.31 2^ > *Sr(S.0468) <r^-|-> (40B7);i;& ^Mzi^ 
<^>^> (W.3002) dzi9-dziej -r^^i^ffl ' <tt^> (371) jr*# 

^^mj ' <^=M^> (518-4) ir^ff ^m^^j » x®w (©Xx^^ 

M> (072-051)*^^ ° 
R.29 1^ ki9 2.26 ? (S.2096) <iS]-g-> (24Bl);iiL ft kiej 
R.35 ft kiej 2.31 ? (S.1780) (l§l-|-> (21B5);i;& ft kia 
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Hh^i^y^m ^Ji*j ♦ iiLj*-&*1.34*iliL4<j2.31 ° 
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^ (l^-|-> ;±^^o kia-kiej — ^i^JD » Nevsky I -307) 

(W.3785, W.3940) ^ib^-ft ^ejnye pozvonki ° 

R.29 111 1.28 *51 ^ ^(S.5424) <la)-|-> (35B6);±^ /lt sio 

R.35 it ^iej 1.34 '51 ^ 4¥^(S.1836) (laj-f) (38B6);±^ /fi §io 

B(R.29);&|^C(R.35);fe^l:o 



3.3 MTL^^f ^^f«^6<j## 

it-l^6<j#^^'J-1^i:^ ' (l^lLt 1993:959-965 ) » 

a) R.30, R.31 # R.IO, R.ll 

R.31 g$ bji 1.30 Mg.-T(S.050P (isj-f) (5A5);±^ bji 
R.ll ^1,] bji 2.10 T(S.3591) <is]^> (2A6);±;& 4 bji 
<X)^> (W.1028) - ^^;^I5]4^15] , jL-zi^i^JD bji-bji» 

R.31 mji 2.28 .!!>i(S.2544) 
R.n M mji 2.10 fk{^.376Q) 

R.31 M kwji 1.30 .^^(S.5707) {n^} (25B2);±^ kwji 
R.ll i.% kwji 1.11 .^^(S.5746) <l5]-i-> (24B6);±;fe- M kwji 
<^;^> (W.416, W.1095) kwji-kwji , o 

R.31 it; Ihji 1.30 *#-;f^^^(S.3347) <ls]-|-> (48A5);±^ f|l Ibji 



R.ll fSl Ihji 2.10 -$^#-;f^^^(S.5001) <|i]^> (47A2);±;& Ihjj 
<5:)^> (W.1083, W.3026) Ihji-lhji i^ifl » t$, ^m^m^^-M 

•Mj °ifi^t#t ' g^ihji ^x^t ^11 Ihji » n-^ir^i^o 
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R.31 it Ihji 1.30 <I51-|-> (48A4);iiL Ihji 

R.ll 1^ Ihji 2.10 A^(S.0406) m^} (47A2);iT Hii phji-lhji 
<x;^> (W.1082, W.3030) phji-lhji-lhji ^ ^ ° 

R.30 sji 1.29 ^^^^tt(S.1052) {n-^} (38Bl);iiL S sji 
R.IO sji 2.9 fJ ^tt(S.102G) <I5]-^> (35A6);i;& «t sji 

<x.f^> (W.985) j'X-^^lbl^^^-l ' (I^^> Ji^lK^ sji-sji 

R.30 i^fi^ iji 2.27 ? (S.2208) <I5]-f-> (48B5);iiL iKt Iji 
R.IO Int Iji 2.9 ? (S.3356) <I5]^> (49A7);i.;& ^ Iji 

R.31 kji 1.30 •i^^(S.5038) <I5]^> (25A5);iiL ft kji 
R.IO fie kji 1.10 -1^^(5.4134) (25B3);i;fr kji 

<5:}^> (W.313, W.1055) kji-kji i^ffl » t^$, ""i^^j ° 

b) R.30, R.31 ^ R.36, R.37 

R.31 bji 1.30 ^ J^(S.0494) <I5]-|-> (5A4);iiL II bjij 
R.37 H. bjij 2.33 i^(S.1309) <I51^> (2B4);ii so 

<5:>^> (W.1027) bji-bjij i^ffl ' Jij 4^ ° <I^-t-> 

R.31 t'l tji 1.30 -^^(S.2704) {^^} (18A7);iiL i tjij 
R.37 tjij 1.36 ^t^^(S.2071) <I5]-^> (15B3);i;& t>l tji 

<5:>§:> (W.1032) tji-tjij Ji-^ii^ ' iJk'^mm ""^ ^-fa^tj ° 

R.31 ^nt dji 1.30 •^^(S.4116) <I5]-g-> (13Bl);iiL djij 
R.37 ll- djij 1.36 •^1^(8.4546) <Is]-g-> (14A2);i;6r Op^^- dji 

<^>^> (W.1047) dji-djij -^i^ffl »^i^ls, ^^^j ° 

R.31 nji 1.30 #^*^(S.2959) 
R.31 I'^i nji 1.30 •^E(S.5081) 
R.37 njij 1.36 il ^ iQiiS.mb) 

<^;*> (W.1041) ^X*^- - ^^j^ 1511^15] o 

R.31 iTi khji .1.30 M^>f^(S.1878> {n^} (24B4);iiL & khjij 
R.37 it khjij 1.36 MI.^^(S.2083) <I5]-g-> (22Al);i;& /ft khji 
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ixm (W.1058) khji-khjij ^^i^^ » ^^^^^f-Mj ° 

R.31 Mi tsjwi 1.30 '^i-^CS.ieOS) in-^y {3m)iiAtk tsjwij 
R.37 d tsjwij 1.36 •4I1>#(S.4928) {n-^} (34B4);i;^ tsjwi 
<x>^> (W.1104, W.1116) tsjwi-tsjwij -^i^ffl » 

R.31 It sji 2.28 ii^-^a(S.2058) <I^-g-> (30A6);iA ^ sjij 
R.37 Ifi sjij 2.33 iJit -lt(S. 1272) <l5]-g-> (31B5);i;^ ^ sji 
Xikf^ sji-sjij -^xt^ (Nevsky 1960: 1 -305, 306) fm 

R.30 ^ dzji 1.29 *«a(S.2540) <Is]-g-> (39B3);iA ^ dzjij 
R.36 ^ dzjij 1.35 ^-'1^(8.4909) 

(^>^> (W.2705, W.2720) d^ji-dijij -^i^^ o 

R.30 ^1^ Iji 1.29 ^(S.5675) <ls]-f-> (48B5);iA Ijij 
R.37 1^ Ijij 2.33 ^(S.4815) <is]^> (52A5);i;^ ^ll Iji 

1 ji-ljij ' ""^^^j ( it^^y 094) » 

rg ( 241-3, 429-6) o 

3.4 -kTL-^-^^^rm^m 

m^^u^j^j^m^i ' m.i2AR.3snMBmiii)^A^.32 ci.31] 

R.32 H d99 1.31 •S'^k(S.0847) 
R.12 gj dee 2.11 i^^SAbbS) 



3.5 -kTt^^^m^mm^ 



3.6 ■kTL-^^.^n^mm^^ 

Cl. 32-2. 29] jiioR.33#R.14^rH^ 
^#3£yL,fel^;*t5] , ^^ty^tkf^^yj^*^^i^}n^ij^^£.^ ' 4fP;lB(R.33) 
' A(R.14);fc^|. ' : 
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a) R.33 # R.14 

R.33 ffti phjii 1.32 •>f^it(S.5249) in^} (9A6);i^ i phjii 
R.14 g phjii 2.12 Mtit(S.1038) <IS]^> (8A2);i;^ ffJ! phjii 

■<X5^> (W.1161) phjii-phjii -z-^i^ffl » :t;^ r^f^itj 

R.33 ii bjii 1.32 M| > ^(S.4779) 
R.14 Vil bjii 2.12 3 >^(S.5399) 

<^5^> (W.1163) ^-X^^Al^^^^ 

R.33 ^.ft djii 2.29 5f > 5tL(S.4168) in^} (16A7);iA Iftdjii 
R.14 '.tC djii 2.12 * m > >E-(S^4169) <I5]-|-> (15B2);i;& ^.^c djii 

in^ty i^m^-=^^'4:MM, djii-djii° 

R.33 it dzjii 2.29 f > 4-(S.4867) <IS]^> (33B4);iA H dzjiij 
R.i4 ^<Jt dzjii 2.12 •#>^(S.0561) <Isi-|-> (32B7);iA dzjiij 
{y^m (W.2930) dzjii-dzjii i^ffi j « 

R.33 tsjii 2.29 Sit(S.2983) <Isi-|-> (39A7);iiL n!^ t§jii 
R.14 fillip tsjii 2.12 3k{S.2%0) {^^} (Sofronov 1968:11-210) 

ficfi tsjii ^ 

nnfmn^ t^jii-tsjii i^m » t^j^ ^m^^j ° 

R.33 U sjii 1.32 l€(S.0228) <Is]-|-> (41A3);iiL f'lr sjii 
R.14 sjii 1.14 l€(S.4072) <IS]^> (39B2);i;fe- ^(f njiij 

<X}^> (W.453);S.ir^#t3^3f njiij-sjii-sjii » 
r (.c) t^fe'l^j ( <^*i^> 399-1 ) o 

R.33 CI. Ijii 2.29 '#(8.4712) 

R.14 Ijii 2.12 #(S.2265) 

R.33 ^1 Ihjii 2.29 *'f§:(S.0319) {n^} (51B2);iA Ihjii 

R.14 Ihjii 1.14 't*>iI(S.5375) <I5]^> (54B3)ji;^ M Ihjii 

<^5i^> (W.2827) Ihjii-lhjii si-^ik^l » r?^'t*j « 
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^^1.146<j ?s -phjii -2.126<j lit phjii^x;SLl.lU^) Rll phji*P;tl5]-ij5.i* ° 
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b) R.33 # R.40 

R.S-^ '4: djii 2.29 '^(S.0496) <la]-|-> (16A7);iiL il djiij 
R.40 il djiij 1.39 5^(8.1173) in^} (19A2);i;& 6^ djii 

(W.1352) djii-djiij^^:?:itffl » ^^^^Aa ° ^^^T 
^^ffl ^ (li*<^> 197-5, 210-5. 300-6 o 

R.33 dzjii 2.29 f > 4^(8.4867) <|5]-|-> (33B4);±i dzjiij 
R.40 dzjiij 2.35 > ^(8.4859) 

(W.2916. W.2934) dzjii-dzjiij s-^i^j^ » ^^f j ° 

R.33 me sjwii 1.32 ^(8.3461) ([i]-|-> (33B6);±i ^ft sjiij 
R.40 ifc sjiij 2.35 >m-rt(8. 5662) <|i]-|-> (29Al);±;&<fic sjwii 
(W.1185) sjwii-sjiij s-^ik^ » ^1^;^ ^^.^tj ° 
3C Nevslty (1960: II -29) iT^i? ""il.j o 

R.33 II Ijii 2.29 #-^'#(8.4712) 
R.40 ff^ Ijiij 2.35 #(8.2266) 
R.40 Ijiij 1.39 '^^'#(8.3497) 

{y^m (W.1368) Ijii-ljiij ^^i^ffl ' ^^^.^ o 



^f] ^1>{^ bji-bji ""^j #BC^6^ 1^^ bji-bjij ""j^j {m^} 
<|g*<^> (346-6) 

^'it <i til ^ vk M i^i^ '4 ^J^^ H 

^ t '<2 X X ^ A T T :^ ^ pre. ^ 
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a) ^BA^^ij bji-bji ^-pj t ' ^-m^^r? (Bm) bji > ^. 
Nevsky 6^^j^(1960: I -573)^;1^1al^,j , ^^^g^^^, ^$ bji (AIM) I'J^ 
T?'J^'f®l5]^'J(Nevsky 1960:11-72) : 

^®5:^t <x>$:> ^Bm^ij bji *iH3Eii/^: ' ^/^:#;mAm6^ 
^$ bji ^i^ffi (-io: bji-bji ) ' A^j^J 

"f-^Am^^ '4 bji I"J^iH5^.v<jt > ^tw/^:;^J:.®f;fljL6^ bji-bji i% 

10.171 firUi^ ^^T 
34.271 

37.252 T(^^^) 

37.251, 38.221 (^^^ ) 

38.241 ^i^,^ )ra$(R «T*A 

^1^;!: Nevsky ^il® ©^/f^ 6^x4^15] , ^^^^^t <XJ^> i^^^j^J^fp 
-l5:ii!L§S* ' Bm^^ bji ^T^S^^^iH^i. o tK^^Am^^ '4 bji i^ffl ' ^ 
° ' Am6H$ bji I'J;r^^aT^$?ib3Ei ' -<!L^#6i^ bji 

b) >-XT4^1#;t*BC^ bji-bjij o 

Bm^ij m bji Nevsky 6<j^J|tat^S'Jls]^'J(1960: I -569) « Cm 
6^ H bjij ^I'JtT® =.^1111^1^'] {X^lt 11-354) : 

<X)«:> ' BMiii} bji <Xi*> ^A^^^v^ > *i5;|:|ia 

C^6^ H bjij (-in bji-bjij ^Xj ) ' ^-'^'J « C *S 6^ 
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bjij <X)^> t*iiJ5£-=^ + ^^ » ^^^\>t^?^X^Flf1l \i bji- 



10.132, 37.252, 78.121 






23.121 








23.141 






TJi 


27.111 




lilt ^ k 


( M ) 


32.272, 36.233, 57.241 




-t* 


52.141 






-^t ^ 


60.121 








67.211 






Ml. 


82.172, 84.141, 4 


% 3.221 If k 


%^ 


# 7.132 








# 10.141 






) 



iijJA ' A-n & (free form) » .fii B Kt^mPftiA^CI^f^^ib 

5^' ""Pft^tf^j (bound form) « 



4.2 ®i:tf 6<3-|-^#^ » ^^hit*-=tt;l:#!^7L-|-^f«16^#^°i®j:tf-0 
i^t ' %-mB.%7^^^A$ii>^ Sofronov ^;f^6^j;J^^|[Jt«i » Si5. 
$-t + ^-Si>g#|^-'Jv^5tfM(Sofronov 1668:137) o ;j^^^^|a.$-,js#jt 

1993:952) 



A R.IO [1.10-2.9] ji 
R.ll [1.11-2.10] ji 

Bm B R.30 [ 1.29-2.27] ji 
R.31 [ 1.30-2.28] ji 

cm C R.36 [1.35-2.32] jij 
R.37 [1.36-2.33] jij 



% 7t ^ 
Ai R.70 [ 1.67-2.60] ji 

Bi R.72 [1.69-2.61] ji 

Ci R.64 [1.61-2.54] jij 



9 & Gong (1988:820) +i^teR.31#R.64;JaiS& ' ISfliif li^^ifi^jlt^^ ° 5£ 
(Gong 1989vX;S.ft 1993) e.i$[-iaR.64ji|.iS^R.36;?lR.37;Jaa& ° 
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^x^^^^^m$-J^-i?miik^^B AA^^BC^^ ABC ^mmm 
m^Tt^fA^^^^f-^^' ^M^n.^BA^ABC^m^i'^ » p-^tAm^cm 

^ #^ 7t-i-^r«i A AC mi^n-^ym ^^b mnn n ^mi^ ac 

T : 

# 7L ^ t 7t ^ 

Al^ A bji2 >fg,, T(2.11) > A, bj i^t^g. M^Td .67) 

i 

BI5§ B icifi bjil ' T(1.30) 

Bi^ B bji^ _L(1.30) 
T 

cm C H bjij2 ^ » _L(2.33) > d )tt bjij^ ^<'ft.^ d .61) 



4.3 ik^^^'i-s^^m^m^fAM.^^^ (AmAcm) ^^fi.^ {Bm) 

©w(1964)|^ Sofronov(1968)-^ta.j^R.8^R.14 J^tstr 
Tt-I-' .?iiR.285.R.33 ^-J*t:^7t-|- <> ^^Cli^# (R.34 

^R.40) Sofronov ' ^^iki&$,t:i!']^^t7L^'' 





®w(1964) 


Sofronov(1968) 




Am^ 








8 (1.8-2.7) 


-I 


e 


e 


9 (1.9-2.8) , 


V 

-le 


1 


ie 


10 (1.10-2.9) 


- i 


ie 

A 


ji 


11 (1.11-2.10) 


-ifi 


i 


ji 


12 (1.12-2.11) 


-«ifi 


e 


ee 


13 (1.13) 


-"ifi 


1 


iee 


14 (1.14-2.12) 


-ifi 


ie 


jii 
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28 (1.27-2.25) 




3 


a 


29 (1.28-2.26) 




A 

g 


i 3 


30 (1.29-2.27) 


-ifi 


jg 

A 


ii 
J* 


31 (1.30-2.28) 


-i 


I 


ji 


32 (1.31) 




3 




33 (1.32-2.29) 


-\ 


is 


jii 

•i 










34 (1.33-2.30) 


-e 


e i 


p i 
c J 


35 (1.34-2.31) 


V 

- le 




i P i 

i C J 


36 (1.35-2.32) 


-efi 


iei 

A 


i i i 










37 (1 36-2 33) 




Tn 


j i j 


38 (1.37-2.34) 






O O 1 

ee J 


39 (1.38) 


-ey(?) 


ai 


ieej 


40 (1.39-2.35) 


-lefi 


ie 

A 


jiij 



^^Ji^^'X^-ih^^iLf/fiMS^A^fi^^B't^j^Swfi^^ =.^^^;5. Sofronov 6^ 
e l?^7L^ o ;^x6^Bl^7L-|-4a't^^S®6^$i#^ Sofronov 6^ 9 i^7l-|- o 

^R.43) o 

^x6^»J;(5t«^^«i-f-7L^6^iit±:g-^TtS®|ia Sofronov » -fs 

;l:^^^^^f*]^^6»;iJM'J;f^^4^X^6^^5£(Gong 1994) » ^efi#^^^=.^ 
(Sofronov MiJ^^ii^ug^) » TL-g-^'^-^M (Sofronov -^^^^^im.) ° ^f^^m^ 

Gong 1989:28-32) » ^'J C->B 6^ ^^bP^ T ^^7t-|-6fj^^^b^h » it;&o_Llg4-6^ 

A-»B e > 9 ee > 99 

ie > i9 iee > i99 ( ;1^#:#J ) 

ji > ji jii > jii 
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C^B ej > 9 
iej > i9 
jij > ji 



eej > 99 

ieej > i99 ( ) 
jiij > ji 



4.4 i^T^^in-si^m^^ ' ^BAAABCAii^n^Bmm-^A^^ij^^^ » 

AA >BA 

CC ^BC 

sar-sar .f^ (f^^ PnL ( 4^f^A#^i^|L;r^^ ) 

^ A # ^ ^ e 4^ 0^.^> 25A-4) 

bow -bow li: fj£ n'lt niL (^^^-ifp^) 

m -in 4 iU^} 6B-2) 

bio-bio ^1 .'t #1 ( ^i^ jL jL:^;?^ ) 

7^ SL SL ±. ^ im^} 11-7) 

thju-thju r,?i fTt 51 f'.k i^^tt^f^) 

^ t t ± ^ ^ 0^.^> 12A-1) 

rjur-rjur ,5 i| (^i^ti^it) 

It it t (HI? <-?l.^> 8B-7b) 
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YU-YU 5l;4-frtfl (^^Pl-*^) 

^ 3. El P,| iXn im'f-} 22A-la) 

BA^ Ihji-lhji -I i?- a U %^fm^ 

% % pref. m iXn imf-} UA-la) 

AA^ Ihji-lhji fj^ ft( ik 

-Ao % % <-?#-^> 6B-3) 



1. 


dju-dju 


7? 






2. 


dzji j-diji j 


iti 'A 




— > rH'^ ^ —^7i 


3. 


sja-sja 








4. 


Iji j-1 ji j 




ILH 




5. 


wji-wji 


II ^ ■■ ^ 






6 


rjir-rjir 








7 


dzji i j-dzji i j 










BA- CAA^n':i:FfT^iim^ 




8 


. l9-le 








9 


. YW3-Ywej 






n-f 


10 


. kia-kie 








11 


. dzia-dziej 
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12. kji-kji 

13. tsjwi-tsjwij 

14. sji-sjij 

El f :^ ' io^$#,?fiif ^^-F (inJi®^ij8jL^-J13> 

ifc •••<'? # I? ^^-ft^.-.T 

^ ^ T T ^ ^ pre. £ 

^ i Ji ® T :^ pre. a ( (m*K> 346-6 ' 

j^^'h it^^y (295)6^ 

^ ^ ^ 

l^n Si. t4 ti il ^^l ^i^i^) 

^ ^ m ^ '4 !^ 
.^'1 a (i Si mnn^^ 

^II S ^ ^ pre. ^ 
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JU. 5»J3 



AiCi#^it^tt&^ ' i^Bmr-^n^it^i^^ ' i^jib^^oA > B > c 

t ' A > CA£^ ' fliB^'J;!:^^^ o B^6^1i75;l:^i<f,^® j:tfl-#;tJ^l5] 
®i:t§s^,^-^t:i'^#^^fi^jlC^^^BA > BC^ ' ^^f^^.tAB^BC^n 

A^^#6^ # sji, 2.10^ ITl sji, l.ll^i'Xt^^ill^^tfA^fcbife ' isj^^^j 

itm^BCA^^Rtcmnn-^yiii'f'i^.m^^t^^j^ » ® 

i:: *sji rc *sji 

ill : ^ *sjid 1^ *sjin 

^: Ra5 *'sji > 'chi *sjir)> shing 

: *sij > se *sik > sac 

X : njij )f| sjij 

ill: ia njir" 

^ ' 3' nye shes 

: ^0 ni d5 si ' 



11 M±t^m^^^^^Kf\^mf}^^y^iiitmmi%7)^^^^^^{\97l)}f\ -d ^ 
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?I t 0 



1189 i±^ii^)^'^nm^^n} ° 

1983 {y^-:^^^} ' ib:?: : t ° 

1964,1966 <®i:t§cD-5?f^> $-41964 ' 1^^41966 ' tt^ '. f'Ht^° 
1975, 1976, 1977 (S^^^^lt) I 1975 ' II 1976 ' III 1977 ' t^if^ : 1c^[5 

1971 {Jl-^^!^^-^) ' (;t^^^> i?/f^4$- ^ -^ft^^'] ' 1-611 ° 

1986 im-^^"^} ' : f^i:A.l^tbHg.4i ° 

1987 <®i:f§f§;i^-f f-1^^*?f5£> ' -^^V-t4*t^5£^^^ii^r^ ' til 

1190 <#;||^a#f tfi^» ' a^l>ili9244^#-«^P^ ' ^^>*^ : ^^"^tPl^^m 
f'Ht ° 

T^ + ra^l^ra^^^ ' 935-9681 o 
Gong, Hwang-chemg 

1988 "Phonological Alternations in Tangut," BIHP 59.3:783-834. 

1 989 "The Phonological Reconstruction of Tangut through Examination of Phonolo- 
gical Alternaions," BIHP 60.1:1-45. 

1 994 "A Hypothesis of Three Grades and Vowel Length Distinction in Tangut," 
Journal of Asian unci African Studies Nos. 46-47. 

Kepping, K.B. 

1979 Sun' czy v Tan^utskom Pcrevotle, Moskva: Izdatcl'stvo Nauka. 
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1983 Les Kategorij Utracennaja Kitajskaja Leisu v Tangutskom Perevode, Moskva: 
Izdatel'stvo Nauka. 

Kepping, K.B., V.S. Kolokolov, E.I. Ky6anov, and A. P. Terent'ev-Katanskij 

1969 More Pis'men ( ) , 2 vols., Moskva: Izdatel'stvo Nauka. 

Nevsky, N.A. 

1960 Tangutskaja Filologija, Issledovanija i Slovar', 2 vols., Moskva: Izdatel'stvo 
Vostoinoj Literatury. 

Sofronov, M.V. 

1968 Grammatika Tangutskogo Jazyka, 2 vols., Moskva: Izdatel'stvo Nauka. 
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mmm^mmmwmmmmm (1994.7.18-20 ^ m%nm) 



^ffl it] (HIRATA Shoji) 



nmmm^±m^mm "mm^" m^Wio m^um, 5i*^wfitj*^-» 
m^m mmmm, m^mm^Mo mat, m^m.mmMmm - 

mt" s (3) "mmm" . (4) "m («^ia) " mmmmi^.o sMm 



m (The Mitsubishi Foundation) AX*4^ef^ffijR5c • ^aJ^^C^^^TlJ^f^^lS)^ 
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M^tfiE (1 928 : 27-28) JSm5^f^^ig3l^6<3i4W. JiHiTTffi^^^^S&ij^a : 
J-^mnn^. »S^(1984.U W!8lJ^(1982). $11(1986). W§J^(1987)^o ^ 

^mmmmmm^m^mis^-m^mo mmmm^^M^mw.-^^ 

mm^im-mmm. mM^^^^mm^it, 

«S5!tf«&<3-f@^^^]^o (165) 

mmtmmp t k smmm, [b d g zmmrn. mm-^niKm. mj,^m 
&^^mmu'^'^itLm^Tm-^%m : /p ■ f t t- n i ts ts- s t<i w i> <i 

k k' 0 h/ 
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2. -n^" mm'^m'M 

2.1. "^5^" fn+*3?^6^j -mm" 

(1) 4^*^}S3?#«rtgmw^«^w (voiced) etjifift. {iiiM^^BiT 
Wim^m "tense/lax" ^ «)ttb«f^^»i^9iis:^i "^vs^m^t" nmmmm 

(2) "±tf/?^?f mmmmmmm^m^ cgiottaiized) /# 

(non-glottalized) " ^ ^l^»^mS^ : (^tt) 6^+*^^^®- 

im±ummm) mmit. mmmmM.mt (breathy) 
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(D^±m'^mnti:m^ (fortes) muw cwf^^ig) . ^mT-^^m^± 

(5) "mm- mmmm^mnw^^mmmmmm (#0)) . anti^rtsi 
nmmm^nwi -5^ (tenseness) " ^i:WL^mM.^^, ^^i7wm±m^n 

mil^^f^mn'^ Clenis) o 





glottalized 


non-glottal ized 


tense 


± m 


)k m 


lax 


± m 



(-) ±^ /pa33/ iJjff /p'a33/ ±M /ba33/ 

[pa33] [p'a33] [p^all3] 

(-) ±^ /pa33/ /p'a33/ ±M /pall 3/^/p- all 3/ 

[pa33] [p'a33] [pall3]g4[p' all3] 
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H ^±m^n^mmijm^mmm (HashiBotodseo)) 







glottalized aspirated voiced 




V f ■ trfr 

±m 


+ — — 

— + — 

— + + 




±m 


(+) - . + 

— + — 

— + + 


mm 




+ — — 

— + + 

— + + 


trAf III 




+ _ _ 
— + — 


iiiii 


±m 


+ _ _ 
+ — — 




±m 


+ - ( + ) 
— + — 
+ - ( + ) 
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(6) m^i^mmhm^±m. ^km. ±m^m^nmmm.mnfji -^-^sg^mw 
2.2. -m^" SEmm:^t . 

: 131-132) w^. Tgm^mk^mm^m^sc^mM.^mm'^mM^mmu. if# 
¥iiis m^m^mm. mm -^m. d^m. ±m" ^m^nmnrnT-m^m 



3. ^mmmm "u^" 
3.1. j^mmu 

3.1. *^«3?^«;:mft 

i-. m^^mSi "#«^»tOTS§ (Princeton Hypothesis) " ^°&^tt}^S,^o 
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p 


p' 


P* 


p' 




p 


p' 


bfi 


bfi 


mm (Mil) 


p 


b 


b 


b 


mm (^tsc) 


p 


P' 


p' 


P* 




p 


p' 


P* 


p' 



(1) mM (mm) cMTTtms.^ Li^Ttnom > i^-. -mm^'^^mi^m, 
M) o -m mm - . (mm - r.'^mmihmma^mm^, R^^in 

(2) R^+mmmmmmmmwmA {^m^^my . Tmic immy 
^m-^^-m"^. mm^^m^fumm^o #iiffl(i 988 : 73)o 

(3) maisMsgmw. +^w^i^mmmtijm^±mm'^. mm^-mmfH 

M^mWo l¥M,5ljtIJ5!t(1981). im^(1987)o 

(4) ^w.'^tmmn^^m^mmmu.m^. #^^t-(i 986)o m^mmm 
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3.2. *i):7fs?^m 

3 . 2 . 1 . if Am "*ij^m3?«m" mmm 

(lus)pffmm.. ^ixmi^mmu^iLW^m'^.- M'f. u^. ma. ^^;?s8:it€ 

inoto(1960)s Sagart(1984)s Sagart(1988)s i^±'^{ini)^^^^MM^Mtm7^o 

(1) fn "iJcmm- 

"d^m'^mit" ^mm^^d:mmmi^m^^ (lems) muw (Mtc 

Hashiii.oto(1960)I^MJWs MM&^J "i^mWm" S^fSTO "^m/^rf s ^ 
m" "glottalized/non-glottalized" S^J^al. Ig^lillJfif tbi^- 

(2) ")cgtt)iliE (hypercorrection) " 
Sagart(1984:89-91)JIili6tJi^?&±6^ "ji^fijiiE (hypercorrection) " Uo 

ggProto-northern Gan" . ilfttAfi^lgM-MgimJl^jl^^ifSl^r^ms ± 

iK^j^mm^mw, mji«tmig. ■^mmm^, "Mttmigproto- 
southern Gan" o ^mmMm'^m^mm^i^ . ^tmi^^±m., mimijm 



ERIC 



-226- 



23/ 



® Sagart(1984) 





±m \km ±M 






p mm p' p(i i^is 






p mm p- i^is P/pfi mm 
i i I 

p mm p- 1^1^ p- Pii^ 
I I i 

p Pit@ hmm hmm 


eE:lk:^B^4 b > p/pfi 
ffl[p']^-^^*ife6^[pfi] 

hypercorrect ion 


(2) Sagart(1988)6ij(i^EM, 




±m r^m ±m 






p mm p' Pile b mm 
i i i 
p Pits hmm hmm 


i)Si:lkig (PNG) 

hypercorrect ion 


mt) 


p HIS p' mm pfi Piii 
1 i i 
p p- i^is p' mm 

i I 4 

i>mm hmm hmm- 


mtijBmi. h > pfi 

hypercorrect ion 




p Pi3@ p- mm pfi Pits 
1 i i 
p Pits p' mm p' Pits 


m^mm (psG) 
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( 3 ) mnmm 















> 













n i}km) M-^iiATo " fplA^(1989 : 774-775 )XM3it: "^kmitM" tt 
3 . 2 . 2 . i^ln 

ia±HMfSf?prisEtM^o ^ii, mm^m\^i^^n < i ) s^j^is -urn 



4. M¥^3LJe^ "Jg^" 

4.1. mmm^ 



ERIC 



-228- 



233 



(1) •Mmmn Mm^mm. *^^ii¥iii[b~pfi~p-], iiKiiEp] eg 

6:16-17)o 

(2) mm^^^ ^mmmw. "^u. 
mtmn. 'm pa" ~ ba^. 

,11 ^.,11 m tsao" ~ 



dzao 



tqiao 





m¥ 




Pi* 


PiA 




p 


b 


P 


b 


m^mm 


p 


b 


b 


b 


Mm 


b 


P 


b 


b 


±^ 


P 


b 


b 


b 




P" 


b 


b 


b 




P 


P 


P 


P 



m kioD " - gioo " 
i^^(1 985:4)) , [^7^;} o 



4.2. &1ig*^-^3l§4>lifmm§6i)M 

Nor«an(1 991:346)&tj^lt, i£3^f^&1ig, m=^^mm±^m^. SJ^^^SM^ 
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20-30% \tmmmM.m^m^mo Tmm^^-TmM.m^mm^±mm^^m$m 



4.2.1. ^m^m^m 

M^M, #¥ffl(1988b:313)o 

±M ?p ?t ?k — + — ^ — 

p- f k' - ? + ^ — 

b d g + ? - 1^ (R^EiiMii^aiJi) 

bfi dli gfi + - + Pi b d g 



EH ?P ?t ?k - + 

p" f k' - - ? 

b d g + + ? 

pfi tfi kfi + - 



- 1 b d g 16 
+ 1 pfi tfi kfi 16 

- fS ?p ?t ?k Ji 

+ (Hi£iil9HtrtS?l±5JlN) 
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-m- . m^mummmmwio w^^^ -ig- "5^" . m.^m^^si-z 
mnmm^m. '^mmmm "mm- m "m^- ) ^^mmm "mmm- 

4. 2. 2. 

¥ paD-2 «*b) • level, flat"" 

phaQ-2 (<*bh) 'to roll (cloth) siooth' 

M . tik-8 «*(!) ■ straight" 

thik-8 (<*(lh) ■ to coib out straight" 

mnmmm^m-^. mmmm^mm^\ (m^g:^(i99i:9i3-9i4). mmmi 

990)) . H^^^^flJcHim^mMiiTl^W^tllto 



5. ^SB "u^" &imi 
mm±mim, ^mvmm. ^^mMmmm.. mm -n^- mt^^^m^ 

lSm^^Iti5'» . «eii7Cfi(1935)m^lg^#fi^t±m. Pj|ft3fe^(1991:213-21 

mmw^$., #^i¥ffl (1983-84) 

m-Ji Ba^o feiA^(i989)g^^. m±^ft+«E 

iitlSam m#^^^-»5 fiJt^o ¥Lij(1960:22-24)Xl'm it "^3? 



ERIC 



-231- 



242 



^mit mmtiip t] > mat^m ui 

[th tsh] > [h s] 



^ &ii ±^ &ii 
^^%mm % ^ n %/u 



mim^ (Shizuoka Hypothesis) " o t^5t^^-i^[6]fpI:^$^iMfi^lJ[, X 
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4-5)?g{±l. "^^5t^" 6^ "fg5^&4)^SiHM±t!ljgfi5teE3?«^ 3219^ 

¥LU(1960)fi{J:i:35: (W^^^aSl) M^^sE^^^o R^¥liJ(1968 : 145)@ 

^Uj(1960 : 9)|Ji, "tense/lax" SfflT R. Jacobson and M. Halle(1 956). 
Fundamentals of Language &^^^o 

^Jj (1960: 11)^, jltfiigifei^f R. Jacobson 6^ "checked/unchecked" o 
li^(1983 : 60)tilffil^o (19 60 : 1 1) tSJiMjcffi (1 928) ti»f Til^^M S 

¥Uj ( 1 9 6 0 : 1 3 ) l?.M@f^-l^:^M^m^ig&^:i:v53?^Rff [ h] ^T&mMJii 
(1 98 9) -«:^i';^-bM^Jo 

¥Lij(i96o)T®Mitt^i^:'«5?«sg "iswrntt" mmim^ijmmfm, »o 

||S^(1987 : 85)|Ji: " "S^MSf^* ?^IS4^Wtil^^m&^. ilflS^M 
1ST -mmmmtr mmiSim^-, 2^M»Ballard(1971 : 148)o 

sagart(i984: 9i)s.m iLmmmM}f^=f-mm%mmmm^',mmm 
:^mmm^mijmM.mt (breathy) m^^^mmm^mo 

#^S^(1986). ]^7Gfi(1948). te@M^(1986)o 

#¥ffl(i983-84)o &^^^mmmmmijm%mm?injSLmmm^mm w 

S??^^(1984). W*^fe(1987). ^,£^(1988). m^^lC^^m^^'^ imS) . 
PMI)[(1989)^o 

"T^. "M^tr afemramo mm m^) <^^> = 

^^mmy *i9i) . (^§i> ¥5i$feS^«M: h^^iw^m^x" (4^ 
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wimmmik^^^m-Mmu-mmmm^m (Dantsuji(i984)) o 



Ballard, Williat L. (1971). "Scenarios of Change in Chinese Dialectogy", 

Acta Linguistica Hafniensia 13.2. 
JfeifM. 01^(1986). {mWUm^'^U) . iljm} 1986. 4 : 273-276. 

w^j^(i982). {nmmi^±m^nmm'^mm . {^mmsui 1982.4:273-278 
. — (1987). {Wiwm^'^-^m^^mim . i^mmio im.2 ■. 101-109. 
w*^Ik(i987). {^mimijmmtmnu) . <mmm^\ im.i 85-101. 

(1990). {^m-^^ljmnM^nmH'^) . iUW) 1990. 1 : 42-43. 

Mjuffi [Y.R.Chao] (1928). (^f^^igfitlSff^) .^11^*$^^ 

— (1 935). ('pmumn'¥mm'^mmm) . i^mmmiy 5.4.515-520. 

M7uffi^(i948). iMZUmm^n^} , ±m ■. mmwm. 

m^^. fflifHii(i99i). (mmmm^imm) . i'pmmw^m 4 : 212-230. ;it 

mitMam). {mmum^^m^M.'^) . imm^^y 1988.1 : 131-134. 
— (1989). (mmum. m-^m. ^w^mmm^mm^mm^) . {mmm 

^) 1989. 1 : 1 13-1 19. 

isg?js(i990). .mm^mi^m^t 

Dantsuji. Masatake(1984) . "A Study on Voiceless Nasals in Burnese". Sta- 
dia Phonologica 18. 1-14. 
ffS)i(1984). (^nUmmmm^M) . C^D 1984. 2 : 109-127. 

si*ifeS4(i98i). <+~m^^mtmmti7'^ (i^nu-^) ) . m^mmm^pfr^ 

ffj) 52. 1 : 37-78. 

Hashiioto. Mantaro [*8*Si:J«5] (1960). "The Bon-shio (^:^) Dialect of 
Hainan, A Historical and Coiparative Study of Its Phonological Struc- 
ture. First Part : The Initials". <Ml§iSff^> 38 : 106-135. 
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— (1989). (mfik^mjkmmmm) . 60.4 : m-m. 
¥fflit](i982). (mmijm^±is^9inmf^^) , imtt^m n 33-51. 

(1983-84). (^Sf««^*a^> . im±^l&} 14: 18-30 : 15:22-26. 

— (1988a). imtijM "mAm" 6^ttff> . 1988.1 : 12-24. 

— (1988b). immmitijmo^nsimi^- mt^] msmmi^^imn^m 
©{Sift-) . («g5&©»Pc3e> . :5?l5±^A5:^}-*ef^Rff. 305-328. 

— (1968). • S±#©liSf:gt«j©^St--^''^-c> . <a 

^"^M^^^y 20 : 140-151. 
^iJt-(1 986). (WSgfi<J^ia («) > . C^W) 1986. 4 : 253-261. 
115^(1984). (^^ (Tm :^ti^Mfi<J*W> . C^W) 1984. 1 i 47-50. 
(1987). C^m^m^^mP^^^n) . C^W) 1987. 1 : 81-96. 

^11(1966). {umm^'^^tiT) . imic^m) , mm^^m. 

— (1986). iu^um^-mrr mi&m&m^''mrr > . <:^w> i986.2 :io6. 

^.^t$(i940). . Menm 

-^^^^(1991). <i^j«igftijis> . imi^^n^mti. 

^^:fh(1984). {^mUmmm^m) . <lgt^lS^> 12 : 66-83, HP. : ^ 

Norian, Jerry. (1991) . "The Min Dialects in Historical Perspective". Wang 
(1991) : 325-360. 

Mt(i99i). {mnijmm^'^^M) . iijm 1991.1 : 30-39. 

Sagart. Laurent(1984). "How did the aspirated stops becoae voiced?". Com- 
putational Analyses of Asian & African Languages 22 : 87-93. 
(1988). "On Gan-Hakka". <tfll^?B> iil8. 1 : 141-160. 

tfc$ift(i988). inmmmt^m'^t^m) . {^mi^j&y .nz-in,±mm.-niiimt 
^ i^(i 983). mmimm'&mu^nwL) . <ggw9F^> i983.i : 49-81. 
iiiBaB#ii(i988). (i5cjig)|f4J-ii^asg£®«Jf^> ,w.p:imi. 
m^. Pt.4.i^(i 993). immiimm^} . im ■. ^im^iamt. 
im^(i987). {[unm^ijwm^mMm'Mnzij^^) , (i^aig:^) 1987.1 : 

24-33. 

Wang, Willia* S-Y. (1991). Languages and Dialects of China. Journal of 
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Chinese Linguistics, Monograph Series No. 3. 

m ^{lui). {tmumm^wL > . <:^i> igse.i : 19-38. 
Wi^^iim). {^j.i^mm:^mmmm) . <€?-7K^4:^E^®iS5:m> . 1-35. 
— (1 989). {^^mmijm^±m^nmmit) . oiw^> 7.2 : 41-73. 
is!p*^(i 984). {^m.■^\h15m'¥m^^MM=^) , i^pmrn^} im.i ■. 357-358. 
^^p^(i96o). mmijwmm jm-.^^^^^mmt [^^(i983)m-ii5] . 

Yue-Hashiioto, Anne(1976). "Southern Chinese Dialects : The Tai Connec- 
tion", Computational Analyses of Asian & African Languages 6 : 1-9. 

(1991). "The Yue Dialects". Wang(1991) : 294-324. 

?l3t^(1 98 9). mn:^Wm^n) . iUm 1989. 1 : 40-46. 

?i?«B^(i986). mm±Mi^mijmmm'^^^m'm^nmit) , <^i§i^^> : 

140-15 3. ±7S«i^WaijlStt. 

»5ii^^^(i 985). (m^-^mmmm^) . itm i985.i : 39-45. 

(1986). (t^WlJmm'^m im) > . {IfW} 1986. 1 : 8-18. 

— (1988). (mwm±Mijm^m^mmm'^^mb^ 7dm) . 

232-23 7. ±Mm^iiim±. 
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1. 

MH{@^®J{§f?iag)J^mmmM(language typology)6U^®^^ ' ^HIS^® 
M!l5S#ll?^#{@^e^ri(iinguistic components)/t^61]Siljfta'^ ° 

mm^nm±^l^J^ ^^W^WLj (syllable-counting)^^5a'S&5fi# ' ^ 
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(1) ^m^mm±M. 

(2) /jN^a/jNffl^^Mii 

(3) 

f^fef^^HJ! ' n^Um^mn^^mmmWminc^yy stress Rhythm):tSO 

238- 

249 
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(4) ( Nasty : 

Who's that thief in nasty thoughts? 
Who's that in that nasty calls? 
Who's that eat that nasty foods? 
Who's that came to my nastic room? 

mt^^M^iffSMM ' 1^3. 2# o -mU^ ' filtWti5(stressed syllable)^ 
ft&S^^^SSi^M ' #m0[WS5(unstressed syllable)mf.&^^i^M ' #{0SW 
Itmm^^^mm'^m'^n » mM^ra^P^CTiming Interval)MimSS*@^S^ ' 
JtbfiP^iifi^ ""^^Wf^j (isochronous movement) ° [SH] ar1I^S^±^l5- 

(5) (1 : 5li-3fT : S^) 

^ -* 

^ 

^ ^ ^ 

' M(5)6^iii^)ii»-i@i^t?!i ' n'B,'^mfm%iE^^^'^x^m 
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(6) m^MM: : Il4-5fj : 19^) 

(7) (MMS : Mnn : 

m^i^mnmm i^immnw^m mimifr^^mm 



^i^i-tjj^^]^ , $ngg^(1975.1979) » M#M5t(1977,1979) ° ^ffl^MTS^^ 

mmmit : ( 2 = w^) 

(8) (IMS : Miff : M^) 

222: 2 2 

/\ /\ /\ /\ / \ / \ 

mM. mm % • mimi^ mm^mm 

(9) (M^iiS : Hiofr : 

2 2 2 2 2 2 

/ \ / \ / \ /\ /\ /\ 

m^mM ¥«tig^ts K^mm tmrn- 
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^H^©--® ""SI^Mj (tone language) ' m^MMB ""MW^b j 
(stress language) ' ^SM^^HaW^^^i®^ ' ^M-:^65W^5!jtgiPW 

(10) im^mM : His- i7fi^ ; mm) 



' aWftr1l^^ifM^^6^?!jtb(phonetic function):^^^Eg^:^® 
: ^iPiJn5SWtf565»S(loudness) ^ JtiJnWtf56^:gJt(length) ^ ijn^»l^61]5Hi|l; 
(contour range) - J^i.S.^if »|^65W15i(pitch height) ° [S2] M(10)65 ""tg 
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^ti#6^mW(heavy stress)mRrtg}t^jgf^iS65M3!llSli^AJg : 

(11) (1 : ^i-3fT : fi^) 

2 2 2 2 

^ ^ ^ ^ 

2 2 2 2 

^ ^ ^ 

2 2 2 2 

^ ^ ^ ^ 

# ' Rrtg^TBB-^m^mwm^^ ^mwm^^ (code-swuching) » ^i^sm 

(left- prominent) » ""^Mj (right-prominent) » jtS^^fi^MW 

ti5#mt£t£l^$!l7^^65il^^«iS : (N = i^S) 

(12) (^^5i5:m2-3fj: M) 
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'■^fSi^j (default tone) ° ISS^SWi^l^Sli^f^Ccomplementary 
distribution) » M#Ktg^«ISII<4g ' =^#l!lfilf5^ailWifJ ° MWS^*!^^ 

Sl^^m:^^^^^ '■jgf^SISmWj (contour toone language)^ ''H 
^SI^^Hj (register tone language)M^i ° rl^liSS^Ef^Sii^ e » "1:1^ 

(13) (MONKEY^E^W : ^SlS I 119^) 
M 

M 

M 

(14) (MONKEY^EaW : l|6|9! ; 10^) 

M 

M 
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M 

m^Bik » ESI^raff:t1^^®M6^ESll?^jtg(distinctive function) ° ^^fg 

(15) m^mmm : m4is ; mm) 

H — L- M H — L- M 

n mmmmm n mmmm 

T H — L- M H — L- M 

T T H— L-M H— L-M 

n mmmmm n mm mm 

T T T H— L-M i i i i H— L-M 

n mmmmm n mm mm 
» i!£^cM^-{@/>^ii5MSif (H) > i&ih) > ^^iM)^mmm^mf^mi^^mmw 

^Si^(register)6^®^:&il^lXf^7ie{^»M(contour tone)S<J?^tg ° » 
♦ ^i5lPf^^(function word cliticization)^^§(6^^#^5g(metrical tension) 
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(16) ' Hii-isfr : 



s s 2 2 

/\ /\ /\ /\ 

mm mm^ mm 



AP NP AP NP 



(17) : ^24-26fT : mm) 




'^-^mw^m-ummyym > m^]p\-^mm3m::-m<i>mi ° if^^ (i99i)-fe# 



ERIC 



-245- 



2bB 



(18) 2 2 2 

/ \ /\ / \ 

\/\ / \ / 

NP PP VP 



6. g^^^^^^fiD^^-^^^iiJ 

p^cfa , :gS^E^|SC6^*&#4^ ' ^^±.m ""iS^lf^^j (immediate 

(19) (M^ • 5i8-9ff ; mW() 

2222 2222 

/\ /\ /\ /\ /\ /\ /\ /\ 

\/\/\/ \/\/\/ 

AP AP N AP AP N 



(20) (M^M • ll29-30fT ; ^^) 

2222 2222 

/\ /\ /\ /\ /\ /\ /\ /\ 

mmm mm im m'^u mm 

m^s mmm mm ^^.w^ mm m^t. mm 

\/\/\/ \/\/\/ 

AP AP N AP AP N 
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(21) m^M : min ; lam) 

2 2 2 2 2 2 2 

/ \ /\ / \ /\ /\ / \ /\ 

mx #^5'Jts M f^M Jg^::g mm 
\ / \ / \ / 

QP Adv WH 

(22) mmt : misfj : mm) 

2 2 2 2 2 2 

/\ / \ /\ /\ /\ /\ 

\ / \ /\ / 

Adv Adv N 



7. ?^P^6UM^®5^ 

^j(22)6^i^)tm#i^^^^Miigii±6^a^ ' ^mmm^mm=^^mi^ 

(23) (M^ : 1^14- i5fT : 13^) 

(a) 2 2 2 2 2 
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(b) 2 S S S S S 

(24) (li^m : ||26fT : S^) 
^1!^^ E ^ 

mmm'?-mTm^±mmnmmjt^ * ^^^jmmm ^m^^mmj (stress 
clash) ° mmmmp^m (i9n) > MM(i983)^#iiai » ^^;tM^fi#^s 

(25) : llifr : S^) 

s s s s s 

/ \ / \ 1 1 1 

m'aMm mm. n % m. 
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lM^MIIMffiS(1993) - SMS^1M(1993,1994)^A^M ''Sffi^ffi 
J (universal grammar)^ig^fi^ ''S'Jii^j (constraints)>iiN^tb^^^>®/> 
(minimal)^SS^^S o MrfH^m. » ^-{imWI^AMCinpuDWi^E&S^CGEN 
fit0J[I-m ■"I^Sfirffimj (candidate outputs) » ^4^^S^i£^i^a^f Ui^llc 

M§^i>Mi^6^ii^t±jfiiiPM ^mmmmmj (optimal output) » utipmisfi^ 

mm^MWij (Optimality Theory) ° '^mm^mmmmwr : [i£+] 
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(27) 



X 

is. 




(28) ^^^Mmimm^M^^mm 



(29) 2 2 2 22222 
/ \ /\ /\ 1 1 1 

\ / \ / \ / 

N WH V 
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*:^:xmm'^n^mn'^mmnwm^^m'^j^^ » ^wmmo. 83-o3oi-h 

-004-115 ' mitmmmn'^^^'ij-^n ° @^ lr#ff lSC^(Professor 

Alan ?rmcc)mmum^noTms: ' lik&mn^nBm^xmmMm 

Generalized Alignment - ° ^^^JilEt^^^^mm^SFfB^Sm^JI 
(1958) - ^iBmtimi) - EKl975,1979) - M(1979) - M^(1982) - 



(1987)^^ o 

SEg : ^^j 6^{^#^±IIi^^^^H^(L. A. Boy Z) ' 

i±£ : IISS$>?1l^mW6^^W?^jtgif #KME3t(i976) ^ ^l(i982) ^ m3M 

(1985) - fe^^ (1990) - 1^^^1M(1994)#^ ° 



^M^(1973) ° 
(1994) ° 

: mi^in mMS!^»J^±^6^#®(major pause) ° ^^S^^liin 

tgif#K£»(1990) ' g^fi^^l^in ^?^tgif #M1^(1993) ° 
: ICW^&^iil£^P#MM(1984) ^ ^^(1986,1990) ^ l.^.M(1991) 



: *^®B}{f ""SitlSI^^Si^j (Optimality Theory)^A^tf ' Bmmm 
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Causative Compounds across Chinese Dialects: 

a study of Cantonese, Mandarin and Taiwanese 

Lisa L.-S. Cheng, UC Irvine, 
C.-T. James Huang, UC Irvine 
Y.-H. Audrey Li, USC 
C.-C. Jane Tang, Academia Sinica 

1. Introduction 

One of the controversies associated with resultative verb compounds (RVCs) centers 
around the level at which the causative RVCs are formed. There are three different approaches to 
the formation of RVCs: (a) a lexical approach (Li (1990, 1993) among others): causative RVCs are 
formed in the lexicon; (b) a syntactic approach (Huang (1992a)): causative RVCs are derived 
syntactically; and (c) a mixed approach (Cheng 1992): there are two types of causative RVCs, 
lexical and syntactic causatives; the former derived in the lexicon and the latter in the syntax. A 
related issue of RVC formation is the question of how close the representation of the compounds 
should reflect the meaning of the compound. 

In this paper, we discuss causative RVCs in Cantonese, Mandarin and Taiwanese. We 
show that the difference in the formation of causative RVCs between Taiwanese on the one hand 
and Cantonese and Mandarin on the other is reflected in a restriction on the defmiteness of the 
postverbal object NP. We argue that the difference is a result of different levels of causative RVC 
formation: in the lexicon in Taiwanese and in the syntax in both Cantonese and Mandarin. We 
further show that the lexical derivation of causative RVCs in Taiwanese is part of its overall 
"analytic" nature by discussing constructions involving the light verb Do as well as typical 
causative constructions. In comparison with Cantonese and Mandarin, the more "synthetic" 
dialects, the representations in Taiwanese are rather "transparent" with respect to its meaning 
composition. 



2. Causative constructions 

As frequently noted in the literature, RVC formation is very productive in Mandarin 
Chinese. Long lists of such compounds are readily available, such as da-si 'hit-dead'^ ^ , qi-lei 
'ride-tired'*^ ^ , ti-dao 'kick-fall'^^ #i , zhui-lei 'chase-tired'il S , qi-si 'angry-dead'^S, ft , lei-si 
'tired-dead' ^ ^ , zui-dao 'drunk-faW'^ M etc. Such productivity is shared by many other 
dialects of Chinese. For instance, all the examples just mentioned have the exact counterparts in 
Cantonese and Taiwanese. 

Syntactically, these RVCs can be either intransitive (i.e. in the pattern [NPi RVC]) or 
transitive (i.e. in the pattern [NPi V NP2] ): 

(1) Tahe-zui-le. fe'^iSfl unergative 
he drink-drunk asp 

'He drank himself drunk.' 

(2) talei-si-le S 7 ergative 
he tired-dead- ASP 

'He is extremely tired.' 

(3) Ta da-si tamen le. ^ & ^nl transitive 
he hit-dead them asp 

'He hit them dead' 
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(4) Zhe-jian shi lei-si tamen le-ii-f^- # ^ ^Lifc^ 7 causative 
this-CL matter tired-dead them asp 
'This matter tired them to death.' 

The two transitive patterns are further divided into transitive and causative, as shown in (3) and 

(4) . (see Cheng and Huang 1994 and later discussion in this paper for the distinction between 
them). These sentences have their exact counterparts in Cantonese: 

(5) keoi jam-zeoi zo 
he drink-drunk asp 

'He drank himself drunk.' 

(6) ngo mun-sei la 

he bored-dead part 
'He is extremely bored.' 

(7) keoi da-sei-zo keoidei 
he hit-dead-ASP them 
'He hit and killed them.' 

(8) li-ceot hei mun-sei keoidei 
this-CL-movie bored-dead them 

'This movie caused them to be very bored.' 

Taiwanese also has the counterparts (as shown in (9)-(l 1)), except, unexpectedly, for the causative 
counterpart in (4), as shown in (12) (see Hsieh 1993): 

(9) i lim-tsui a. 
he drink drunk ASP 

'He drank himself drunk.' 

(10) i thiam-si a 
he tired dead asp 

'He is extremely tired.' 

(11) i phak-si in a. 
he hit-dead them asp 
'He hit them dead' 

(12) *tsit-tsan taitsi thiam-si in a. 

this-CL matter tired dead them asp 
'This matter tired them to death' 

The contrast between ( 1 1 ) and (12) shows that it is not tht '-ase that Taiwanese simply does not 
allow the RVCs to take a postverbal object. Instead, it appears that the "transitive" use of the RVC 
contrasts with its "causative". Following Cheng and Huang (1994), we assume that there are two 
types of transitive RVCs: one with the first verb of the RVC denoting activities and the other, 
states/change of states. The first type, illustrated in (3) and (11), indicates that some action of an 
agent results in a theme being in a certain state (for instance, the action of hitting is done by the 
agent / 'him' in (1 1) resulting in the theme in 'they' being dead). The second type, illustrated in 
(4) and (12), denotes a causer (NPl) bringing about a causee (NP2) being in a certain state. In the 
case of (12), tsit-tsan taitsi 'this matter' is the causer and in 'them' is the causee. Cheng and 
Huang call the type illustrated in (1 1) "Transitive" and (12) "Causative." The subject of the 
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Transitive type is an actor and the subject of the Causative is a causer. Pertinent to our discussion 
here, the Transitive constructions allow a postverbal object but not the Causative constructions, as 
illustrated further by the following examples: 

(13) a. i tsau kau in tshu a 

he run arrive his home asp 
'He ran and arrived his home.' 

b . i tshit tshingkhi hit-tsiak to a. 
he wipe clean that-cl table asp 
'He wiped that table clean.' 

(14) a. *tsit-tsan taitsi kiaN tsau hit-e lang a. 

this-CL matter scare away that-CL person asp 
'This matter scared off that person.' 

b. *tsit-kuan tsiu tsui to hit-e lang a. 
thls-cl wine drunk fall that-CL person asp 
'This bottle of wine made that person very drunk.' 

These examples seem to suggest that Transitive constructions involving RVCs can be found in 
Taiwanese as well as Mandarin and Cantonese. However, Taiwanese, in contrast with Mandarin 
and Cantonese, does not have Causative constructions involving RVCs. This is not quite correct, 
however. Complicating the issue is that sentences like (12) and (14a-b) can be acceptable, if a 
different type of postverbal objects is chosen: 

(15) a. tsit-tsan taitsi thiam si (tsit-tun) lang a. 

this-cl matter tired dead one-pile person asp 
'This matter tired many people to death.' 

b . tsit-tsan taitsi kiaN tsau (tsit-tun) lang a. 
this-cl matter scare away one-pile person asp 
'This matter scared off many people.' 

c. tsit-kuan tsiu tsui to (tsit-tun) lang a. 
this-cl wine drunk fall one-pile person asp 

'This bottle of wine made many people very drunk.' 

Comparing (12) and (14a-b) on the one hand and (15a-c) on the other, we note that the minimal 
difference between the two sets lies in the type of the object NPs: in the former set, the NPs are 
definite expressions (pronouns and NPs with a demonstrative); whereas, in the latter set, the NPs 
are non-definite expressions. The following generalization thus emerges: 

( 16) Postverbal objects of Causative constructions in Taiwanese cannot be definite. 



3. Postverbal constraint on definiteness 

Generalization (16) at the first glance appears to be quite idiosyncratic. However, the 
literature does not seem to lack in similar observations. In fact, generalization (16) reminds us of a 
broader postverbal constraint in Mandarin Chinese discussed in, for instance, Li and Thompson 
(1981), Huang (1994) and Tang (1990) concerning sentences containing a postverbal object NP 
and a duration (D) or a frequency(F) phrase. The pattern [V object D/F] requires the object NP to 
be definite (a referential NP in Huang's term): 
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(17) a.*Wo kan-le (henduo) shu liang ci/liang-ge zhongtou. 

I read- ASP many bock two times/two-CL hours 
'I read (many) books twice/for two hours.' 

b. Wo kan-le naben shu Hang ci/liang-ge zhongtou. 
1 read-ASP that book two times/two-CL hours 
'I read that book twice/for two hours.' 

The effect of definiteness (referentiality) of the object NPs on the acceptability of the 
sentences is not only manifested in the Mandarin [V object D/F] constructions, but also in other 
phenomena in many other languages such as word order variations in Hungarian and agreement 
requirements in Hindi. In Hungarian for example, a sentence with a non-referential object occurs 
in an SOV order whereas the neutral order for a sentence with a referential object is SVO, as 
shown in (18) (see Maracz 1989 among others). 

(18) a. a fiu levelet ir (SOV) 

the boy letter-ACC writes 

'The boy is writing a letter.' (The boy is busy letter-writing.) 

b . a fiu ir egy levelet (SVO) 
the boy writes det lettt ACC 

'The boy is writing a [specific] letter.' 

c. a fiu irja a levelet (SVO) 
the boy writes-Agro the letter-ACC 

'The boy is writing the letter.' 

Similarly, in Hindi, a sentence with a definite object NP shows object agreement on the verb 
whereas indefinite and non-referential object NPs do not trigger object agreement on the verb 
(Mahajan 1990): 

(19) a. raam-ne kitab paRhii 

raam-ERG-(m) book read-PERF-f-sg 
'Ram read the book.' 

b . raam ek kitab paRhegaa 
raam-(m) a book read-fut-m-sg 
'Ram will read a book.' 

Mahajan considers that a definite object NP must move into the Spec of an object agreement phrase 
(AgroP) but an indefinite opr non-referential NP must remain as sister of V. 

In the spirit of Mahajan (1990) among others, Huang (1994) proposes to account for the 
contrast in (17) in terms of the base-generated position of object NPs. In particular, a referential 
(definite) object NP is base-generated in the SPEC of VP (sister to V) (in the NP2 position in 

(20) ) and a non-referential (non-definite) object is generated as sister to V (in the NP3 position in 

(20)):' 



1 A subject may be base-generated as the SPEC of VP (the Internal Subject Hypothesis, see, for 
instance, Fukui 1986, Koopman and Sportiche 1990 among others). If this hypothesis is adopted, 
we will need more layers of VPs in the structure (see Larson 1988). However, it docs not affect 
the main poiiit here that a definite NP is base-generated in the SPEC position and a non-definite NP 
is base-generated as sister to V. 
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(20) IP 
/ \ 

NPl r 
I w 

/ \ 

NP2 ^v;^ 

V NP3 



To illustrate with the sentences in (17), the definite object naben shu 'that book' in (17b) occurs in 
the SPEC of VP (NP2) position. The D/F phrase can occur in NP3 position (see Larson 1988). 
After V raises outside the VP, sentence (17b) will be derived. On the other hand, if the object NP 
is indefinite (non-referential), it is generated in the NP3 position, which is competed for by the D/F 
phrases. (17a), thus, is not possible. The contrast between (17a) and (17b) is thus a manifestation 
of the constraint on the position of object NPs: 

(21) A definite object NP occurs in the SPEC of VP position and a indetlnite object NP occurs 
within V (as sister to V). 



4. Analysis 

With (21), we can proceed to account for the generalization in (16) which prohibits a 
postverbal definite NP in a Taiwanese Causative construction. Along the lines of Hsieh (1993), 
Huang (1992b), Wu (1994) and Zou (1993), we take (22) to be the structure of a Causative 
sentence (such as (ISa-c)):^ 

(22) ^IP^ 

NP2 VV 
/\ 
VI VP2 
I / \ 
CAUSE NPS V2' 

/ \ 

V2 NP4 
I 
> 

tired-dead 
scared-off 
drunk-fall 



For the sentences in ( 15a-c), the object NP tsit-tun lang 'many people' occurs in NP4 position, 
since it is indefinite. V2 moves up to Vj Cause and combines with it to become a causative verb, 
deriving the well-formed sentences in (15a-c). 



2 The compound verb thiam-si 'tired-dead', kiaN-tsau 'scare-off and tsui-to 'drunk-fall' may be 
further analyzed as consisting of two VPs, which will not affect the analysis here. 
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Turning to (12) and (14a-b), the causee is a definite NP. It should be base-generated in the 
SPEC position, NP3. Verb movement (V2 to Vi) applies, as in the case involving indefinite NPs 
since it is an obligator}' process to create a causative verb. This movement, however, would create 
the verb chain [V], V2]. The niinimal domain for the causative verb would therefore be VPi, not 
VP2. That is, with respect to the object NP position constraint, we can no longer consider only 
V2, instead, we need to consider the chain [Vi, V2]. In other words, the Spec position that matters 
is no longer NP3 but NP2 of VP]. After verb movement, the definite NPs in (12) and (14a-b) 
occurs within the projection of Vi' rather than outside of the Vy, violating the constraint on where 
a definite object NP can occur as stated in (21). Note that V -movement to Cause does not create 
problems for (15a-c), since the indefinite object NP are still within Vi'. (21) thus accounts for the 
contrast between (12) and (14a-b) on the one hand and (15a-c) on the other. The generalization in 
(16) is captured. 



5. Towards dialectal differences: Syntactic vs. lexical 

The discussion so far, however, raises the question of why the counterparts of the 
Taiwanese (12) and (14a-b) in Mandarin and Cantonese are acceptable, as illustrated by the 
acceptability of (4), (8), especially considering the fact that the postverbal definiteness constraint 
applies in Mandarin and Cantonese as well. An answer to this problem may be found in the 
discussion in the literature concerning the level at which a causative compound verb is formed. 

Note that the analysis of the Taiwanese causative constructions assumes that the causative 
verb formation takes place at the syntactic level: V-movement takes place at the syntactic level, 
creating a structure where the definite object NP is within V, rather than the SPEC of its V. In 
other words, in Taiwanese, the RVC thiam-si 'tired-dead' is only a resultative verb in the lexicon 
and its causative counterpart is derived in syntax. On the other hand, the causative verb formation 
may take place at the lexical level, as suggested in Li (1990) among others. In other words, a 
surface verb such as 'tired-dead' may in fact be ambiguous: it can be the resultative 'tired-death' or 
it can be the causative 'CAUSE + tired-dead'. The former has one argument [theme] and the latter, 
two arguments [causer, theme]. The causative verb, with its two arguments [causer, theme j, can 
be projected into the following structure, just like any two argument verbs such as 'hit' or 'eat': 

(23) IP 
/ \ 

NPi r 
/ \ 

I ^VP^ 

NP2 V' 
/ \ 
V NP3 
I 

[CAUSE+tired-dead] 



The causer argument is in NPi position. The theme argument is in NP2 or NP3 position, 
depending on whether the theme NP is definite or not. 

We suggest that Cantonese and Mandarin causative RVCs are derived lexically. That is, 
there is no verb-movement to a Cause verb in syntax, as we have seen in Taiwanese. In contrast, 
a sentence such as (4) in Mandarin or (8) in Cantonese will be generated in exactly the same way as 
a sentence like 'John hit Mary' or 'John ate lunch', since the causative verb is treated as a lexical 
item taking two arguments. Given a structure such as (23) for the causative sentences in (4) and 
(8) as well as typical transitive sentences, the definite object NP will be generated in NP2 whereas 
the indefinite object NP will be in NP3. Since there is no further VP projection to "extend" the 



ERIC 



•260- 



^ ( I 



domain of the verb, NP2 will remain as the Spec of the VP even after the verb raises out of the VP 
to Infl. This entails that what we are dealing with in the case of syntactic causatives (the Taiwanese 
case) is a VP-shell (Larson 1988), the lower VP is part of a bigger VP. In contrast, with verb to 
Infl nnoverrent, we have a simple case of verb movement not involving VP-shells and the status of 
the NP positions does not change. 

To sum up, Taiwanese derives the causatives syntactically and therefore given the 
extension of VP domain in a VP-shell, postverbal definite NPs are not allowed. In contrast, 
Cantonese and Mandarin have lexical derivation of causatives and thus the syntactic restriction of 
definite object NPs is always obeyed. 



6. Further evidence 

In this section, we discuss two additional differences between Taiwanese and Mandarin. ^ 
We show that both differences can be treated on a par with the analysis that we have proposed for 
the causatives. 

6.1. The verb(s) DO ^ 

Consider the sentence in (24) in Mandarin and its Taiwanese counterpart in (25): 

(24) ni ku ni de ^. 51 6<j 
you cry you de 

'You go ahead and cry (and it doesn't concern me).' 

(25) li tsoli khao -ft. -Si: 5C 
you do you cry 

However, these two ways of expressing a rou|;hly similar proposition are not interchan gable in 
these two languages, as shown by the following two ungrammatical sentences. 

(26) *nj zuo ni ku i^-^ ^ % % 

you do you cry 

(27) *li khao li e % 

you cry you E 

We propose that the distinction exhibited in (25) vs. (26) and (24) vs. (27) centers around the overt 
and covert light verb Do in these two dialects. 

Consider first how (24) is derived. Following Huang (1992b), we propose that Mandarin 
>^as a light verb Do which takes a gerundive IP: 



3 Cantonese counterparts are exactly the same and wc will not discuss them here. 
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In (28), due to the presence of the empty light verb Do, the gerundive verb ku 'cry' moves to 
replace Do via the lower gerundive I. And due to the gerundive nature of the IP, Infl cannot assign 
nominative Case to the NP in its Spec, ni 'you', the genitive Case marking is applied and we have 
an output (24). 

The Taiwanese example in (25) represents an overt realization of the light verb Do. In this 
case, the light verb Do happens to be homophonous with the regular verb tso 'do'. If we replace 
the empty light verb in (28) with the overt light verb tso 'do', we obtain an output such as (25). 
Due to the presence of the overt tso, there is no need for movement and by Economy of Derivation, 
movement cannot take place. Thus, a sequence such as (29) is ruled out: 

(29) * li tso khao li (e) ^ 51 #. ( 6<) ) 
you do cry you E 

Furthermore, (27) is not possible because Taiwanese does not have a covert light verb Do on a par 
with the one in Mandarin. Then a structure such as (28) with a null light verb with subsequent 
raising of the verb will not be possible in Taiwanese and hence the ungramjnatical (27). 

We have seen that in (25) that there is no genitive Case marking on the lower // 'you'. We 
propose that the overt light verb tso 'do' exceptionally Case marks (ECM) the subject // 'you' of 
the gerundive clause. There is thus a difference between the overt light verb and the covert light 
verb: the former can ECM the lower subject whereas the latter cannot. Furthermore, in the case of 
Mandarin with a covert light verb, after the movement of tiie lower verb to the light verb, there is 
still no structural Case assignment. This is due to the fact tiiat verbs such as ku 'cry' does not have 
a structural case to assign since it is an unergative verb. 

These cases can be compared with sentences involving an object in the gerundive clause, 
as shown in (30) and (31). 
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(30) a. ni kan ni de shu 

you read you DE book 

(i) 'You go ahead and read.' 

(ii) 'You read your book.' 

b.*ni zuo ni kan shu 
you DO you read book 
'You go ahead and read.' 

(31) a. li tsoli thaktse 

you DO you read book 
'You go ahead and read.' 

b. li thak li-etse 
you read your book 

(i) 'You read your book.' 

(ii) *'You go ahead and read.' 

The contrast in (30) is the same as the one shown between (24) and (26). Note that even though 
the verb kan 'read' can assign accusative Case, it has only one Case to assign (to its object shu 
'book'). Thus, after it raises to the covert light verb Do, it no longer has any Case to assign to the 
gerundive subject ni 'you'. It should also be noted that we are specifically talking about the _ 
interpretation of (30) as indicated rather than the possessive reading of ni-de shu 'your book', 
which is also possible. This is in fact the only possible reading for (31b). Thus, (31b) is not a 
counterexample to our claim. The impossibility of interpreting (31b) as (30a-i) is due to the same 
reason as the impossibility of (27). 



There is one further contrast between Taiwanese on the one hand and Mandarin on the 
other which supports our claim that Taiwanese is more 'transparent' than Mandarin and Cantonese. 
This involves the contrast shown in (32) and (33) involving the fco/fca-construction. 

(32) a. lanka i pa si 

we BA him hit-dead 
'We hit him dead.' 

b. lanka i pa ho (i) si 
we BA him hit give him dead 
'We hit him dead.' 

(33) a. women bata da-si le 

we BA him hit-dead asp 
'We hit him dead.' 

b.* women ba ta da gei (ta) si 
we BA him hit give him dead 
The contrast shown between (32b) and (33b) shows that the causation expressed by pa-si hit- 
dead' can be "spelled out" in a transparent way in Taiwanese but not in Mandarin. In (32b), we 
see that pa-si 'hit-dead' can be further "divided" into pa-ho-si with the causative being overtly 
expressed. However, as shown in (33b), this is impossible in Mandarin, showing that such 
relations can only be expressed in a covert way in this dialect. 



7. Conclusions and Theoretical Implications 
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In this paper, v^t have studied an area of comparative grammar across three Chinese 
dialects: Mandarin, Taiwanese and Cantonese, and showed that the observed systematic 
differences among these dialects in the syntax of causative sentences and other related 
constructions can be described with considerable insight within a formal model of Universal 
Grammar and linguistic variation. In particular, treating dialectal variations as instances (on a 
smaller scale) of normal linguistic variation, we have assumed that the computation system of a 
language is invariaiit across languages and dialects, the seemingly radical superficial differences 
being reducible to the morphological or lexical variations among them. In particular, whereas all 
dialects compared have a lexicon that contains RVCs, only Mandarin and Cantonese have lexical 
causative compounds. (Pure) causative compounds in Taiwanese must originate in the lexicon as 
ergative (inchoative) compounds. Their causative use is permitted only when an ergative 
compound is underlyingly embedded under an abstract verb Cause, to which the ergative verb 
compound must raise. This causes a definite/referential object to fall within the domain of a VO, 
thus exhibiting the definiteness effects observed in this paper: 

(34) a. *chit tsan taichi thiam-si hit-e lang a. 

this CL matter tired-die that-CL person part 
'This matter caused that person to be tired to death.' 

b . *hit kuan chiu tsui-to Li SiansiN a. 
that bottle wine drunk-fall Li Mr. part 
'That bottle of wine got Mr. Li to be so drank as to fall.'. 

(35) a. chit tsan taichi thiam-si go-pa-gua lang. 

this CL matter tired-die 500-plus person 

'This matter got 500-plus people to be tired to death.' 

b . hit kuan chiu tsui-to chin-choe lang. 
tat bottle vine drunk-fall quite-many person 
'That bottle of wine got many people to be drunk and fall.' 

No similar definiteness effect is observed in Mandarin or Cantonese because the causative 
compounds may be lexically derived, and hence are not embedded under Cause, and hence a 
definite object in the Spec of VP would not be considered to be within V due to verb to verb 
movement in VP-shell structures. 

(36) a. zhe-jian shi lei-si nei-geren le. 

that-CL matter tired-dead that-CL person part 
'This matter got that person tired to death.' 

b. nei-pingjiu zui-dao-le Lisi. 
that-CL wine drunk-fall-ASP Lisi 
'That bottle of wine got Lisi so drunk as to fall.' 

1 here is also no definiteness effect If an overt causative verb appears above the ergative compound, 
since an overt verb like ka, ho, hai 'cause' takes a proposition (a clausal category) but not an event' 
(an ergative VP) as its complement and does not force the definite object to be a complement of a 
(complex) VO. 
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(37) a. chit tsan taichi ka/ho/hai hit-e lang thiam-si a 

this CL matter cause that-CL person tired-die part 
'This matter caused that person to be tired to death.' 

b . hit kuan chiu ka/ho/hai Li XiansiN tsui-to a. 
that bottle wine cause Li Mr. drunk-fall part 
'That bottle of wine caused Mr. Li to be so drunk as to fall.' 

With respect to defmiieness effects in causative compounds, then, Taiwanese is 
characterized as being more of an analytic language whereas Mandarin and Cantonese are more 
synthetic. This contrast is corroborated by other comparative differences in periphrastic causative 
constructions, and in the syntactic manifestation of the semantics of Davidsonian action sentences. 
Thus, whereas in Mandarin action sentences the verb typically undergoes head-movement into the 
position of a higher light verb, thus obscuring the existence of an "event place" in the logical form 
of such sentences (as in (38)), in Taiwanese the light verb typically manifests itself overtly, as in 
(39)): 

(38) a. ni san ni-de bu, wo shui wo-de jiao. 

you take your talk, I sleep my sleep 

'You go on with your strolling, and I kept on sleeping.' 

b . ta ku ta-de, wo shui wo-de. 
he cried his I slept mine 
'He kept crying and I kept sleeping.' 

(39) a. li tso li san-po, gua tso gua khun. 

you DO you stroll I DO I sleep 

'You went on with your strolling, I kept on sleeping.' 

b. li tso li khao, gua tso gua khun. 
you DO you cry I DO I sleep 
'You keep crying and I will go on with my sleep.' 

In the analysis of each of these differences, we have assumed that the dialects under consideration 
differ only the contents of their Lexicons, but share a Computation System that operates according 
to general principles throughout these dialects. (In the causative cases, the existence of an abstract 
Cause and the absence of pure causative compounds distinguish Taiwanese from Mandarin and 
Cantonese. In the case of Davidsonian action sentences, the existence of overt light verb and its 
absence again cuts across these dialects. These results seem quite desirable and optimal, in the 
sense that our theory of linguistic variation makes use of little more than what appears lo a "virtual 
conceptual necessity" (that languages clearly must differ in their morphologies), and it seems 
possible to assume that language variation is reducible to, and in fact limited to, morphological 
variation. This conception of parametric theory is clearly more optimal than one that directly 
stipulates, say, the existence of definiteness effects in certain grammatical constructions in one 
dialect but not in another, or that of a given head-movement process in the computational system of 
one language but not another. 

In other words, on a descriptive level, we can state the generalization, based on our 
analysis, that Taiwanese is more "analytic" and more transparent, and Mandarin and Cantonese 
more "synthetic" and more opaque, in that more goes on in the lexicon in Mandarin and Cantonese 
than in Taiwanese. But from the point of view of a more restrictive parametric theory, this 
generalization can be reduced to mere morphological differences among languages, in particular, in 
the distribution of certain grammatical lexical items. 

Indeed, this "minimalist" parametric theory also appears to be the most optimal when it 
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coii^es to the major differences that distinguish among languages of different typological types. 
One well known typological difference among languages is the existence of "Wh-movement" in the 
formation of constituent questions. In early linguistic literature, this typological difference was 
directly taken to reflect in a variation in the design of the computation systems of individual 
languages: some languages possess the rule of "Wh-movement" and others do not, this in turn 
following from the elementary assumption that languages may differ in the distribution of certain 
substantive and formal constraints. A parametric theory of this sort, however, went beyond 
observational adequacy. As Huang (1982) shows, this conception of the typology of constituent 
questions misses important generalizations about the cross-linguistic similarities and differences 
with respect to subcategorization, scope interpretation, and movement constraints (Subjacency, 
CED and the ECP). Huang's suggestion was to conceive of the w/i-movement parameter in a 
different way: all languages share the substantive universal of having a w/i-movement rule, but 
differ in where that rule may apply: if not in overt Syntax than in Logical Form. The Hypothesis of 
w/i-movement as a substantive universal explained the similar properties shared by wh- 
constructions across all languages, and their differences in where the rule applies account for 
observed differences among these languages with respect to locality constraints, etc. This 
conception of the typology of w/i-constructions enjoys a level of descriptive adequacy that 
previous conception did not in that it captures certain linguistically significant generalizations that 
might have been treated as accident properties of languages. This conception of parametric theory 
is not optimal, however, since the parametric differences, being in terms of the components of a 
computation system where a given rule may apply, relies on an assumption that is not itself of 
virtual conceptual necessity. Furthermore, although the issue of leamability does not arise, it is not 
explained why w/i-movement may apply overtly in English, but only covertly in Chinese- rather 
than, say, the other way around. 

More recent work offers a promising line of inquiry that has the prospect of attaining 
explanatory adequacy. One line of research, undertaken in Cheng (1991), relates the lack of overt 
w/i-movement in Chinese-like languages to the existence in them of certain functional elements, in 
particular, question particles occupying the position of C in syntax. This assumption explains the 
clustering of properties in one language and their joint absence in another, and is relatively optimal 
in that it reduces superficially vast syntactic differences to a morphological difference in the 
distribution of certain functional categories. In current work, furthermore, Tsai (1994) proposes 
that the obligatoriness of overt w/i-movement in English, and its obligatory procrastination until LF 
in Chinese, can be directly tied to a morphological difference in the intemal structure of the wh- 
words themselves. In English, w/i-words have a microscopic syntax with a self-contained 
operator-variable structure; they are therefore inherently interrogative operators, and hence are 
subject to movement, given the general assumption that operators must occur in operator position, 
with expected locality effects. In Chinese, on the other hand, wh words are open categories, i.e., 
polarity items that are underspecified for their interrogative vs. quantificational features. As such, 
they are not inherently identified as operators, and not subject to overt syntactic movement. Their 
interpretations are determined by the licensers that c-corrjnand them elsewhere within a sentence, 
outside of their intemal structure. Thus the w/i-words are on a par with variables that are 
unselectively bound in the sense of Heim (1982) (cf. Lewis (1975)). In the case of the 
intenogative interpretation, it is assumed that the w/i-in-sitxi is bound by a (base-generated) null 
operator. The vastly different syntactic difference between Chinese and English thus boils down to 
the difference of the possibility of base-generating a null operator, of whether the null OP is in the 
lexicon of either language. Chinese has null operators for all operator positions, but English has 
none, except hose cases where the null operator is strongly bound (in the sense of Chomsky 
(1986), i.e.. Ill parasitic gap constructions, tough constructions, certain relatives, etc.). 

This difference in the presence of a non-strongly bound null OP has further implications. 
For example, it also underlies the "null topic parameter" of the sort described in Huang (1984) 
concerning the distribution and interpretation of certain null aigumenis in Chinese and German. 
The theoi7, which is in spirit a minimalist theory of linguistic variation, thus only explains why 
English and Chinese should differ not only with respect to the existence of overt w/i-movement, 
but also with respect to the distribution and interpretation of certain null arguments. 
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Two additional theoretical implications of our analysis are worth mentioning: First, our 
analysis supports the traditional distinction between resultatives and causatives, against recent 
attempts to treat them uniformly. As S. Huang (1974) argued, both the resultatives and causatives 
carry with them the semantics of causation, but a distinction is still necessary, between what he 
terms "event causatives" (resultatives) and "factive causatives" (causatives). Recently, Sybesma 
(1992) argues that both construction types (what he calls "canonical resultatives and causatives ) 
should be treated alike, as forming a typical ergative-causative paradigm. What we have shown 
here is that the three dialects under consideration do not differ with respect to their syntax ot the 
resultatives, but do so with respect to their syntax of causatives. For us, the resultatives constitute 
an unergative-transitive paradigm, whereas the ergatives and causative compounds consUtute a 
separate paradigm. The resultatives have an inherent semantics of causation, but they do not have 
a syntax of causation; only the (pure) causatives do. Our analysis, if correct, thus provides 
important evidence in defense of the traditional distinction, against the uniform-treatment 

hypothesis of Sybesma (1992). .^^ .... , ^ 

Finally if our analysis is on the right track, we have provided additional support tor the 
hypothesis, advanced in Johnson (1992), that there is a process of movement that invariably raises 
the verb out of VP into a higher head position, as a universal pnnciple and irrespective ot the 
morphological properties of the functional projections of a particular language (such as the French- 
English contrasts of the sort considered in Emonds (1976) and Pollock (1989). This assumption is 
necessary to ?ilow for cases of grammatical resultatives and (lexically denved) causatives taking 
definite postverbal NPs as their objects. The existence of such a process in Chinese has also been 
demonstrated in Huang (1992b) and in Kung (1993) in accounting for the definiteness effects in 
connection with the occurrence of objects with certain duration and/or frequency expressions. 



4 See Cheng and Huang (1994) for additional arguments in favor of the traditiona; distinction. 
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A Syntactic Typology of Formosan Languages 
~ Case Markers on Nouns and Pronouns 



Paul Jen-kuei Li 



1. Introduction 

1.1 Theoretical Considerations 

Studies of language typology can certainly enhance our 
knowledge of language in general. These studies are closed related 
to studies of language universals; see, for example, Greenberg 
(1966), Comrie (1989). Moreover, studies of language typology are 
also related to studies of language genetic relationships, esp. to 
the problems of subgrouping. 

What significant features should we look for in language 
typology ? Is there a hierarchy (i.e. degrees of significance) in 
typological features ? Is so, what is it ? Questions such as these 
can be raised and discussed. 

1.2 Previous Works on Formosan Languages 

Most of the previous publications on Formosan languages deal 
with an individual language or dialect, while only a few are cross- 
Imguistic studies. The few cross-linguistic studies are either 
phonological, e.g., Tsuchida (1976) and Ferrell (1979b), or limited 
to a single syntactic feature covering only a few languages. For 
example, Ferrell (1979a) compares only the construction markers in 
several Formosan languages, while Starosta (1974) compares only the 
causative verbs in six Formosan languages: Amis, Bunun, Rukai, 
Saisiyat, Sediq and Tsou. More recently, Starosta's (1988) paper "A 
Grammatical Typology of Formosan Languages" is much more compre- 
hensive. It deals with several syntactic features, including word 
order, topicalization, auxiliary verbs and pronoun contraction. 
Illustrated with examples from eight Formosan languages: Amis, 
Tsou, Saaroa, Atayal, Sediq, Bunun, Saisiyat and Puyuma. However, 
there is limited data in the paper, due to limited space and 
limited field work that has been done on some of the languages 
Moreover, these studies may be somewhat misleading as they are 
often based on very innovative dialects, such as the Squliq dialect 
Of Atayal, which have lost many important grammatical features. 

1.3 PurpoF.e of This Study 

In this study we shall try to cover as many Formosan languages 
as possible. We shall pick a more conservative dialect for each 
language. Ideally a great many syntactic features, including case 
markers, personal pronouns (both long and short forms), tense and 
aspect, focus, word order, topicalization, nominalization, 
relativization, mterrogatives, negatives, imperatives, and so on 
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should all be examined, as most of these features are closely 
related. In so doing, we can compare the syntactic similarities and 
differences among Formosan languages. Such a cross-linguistic study 
will have a bearing on their genetic relationships and may resolve 
some problems of subgrouping. 

To take noun phrase as an example, nouns and pronouns are 
marked for case in the.^e languages. How many different cases are 
there in each language ? In some languages like Sediq and Saaroa, 
there are only two different cases for nouns: nominative and 
oblique. In some other languages, there are three different cases 
for nouns: nominative, accusative and genitive. In still some other 
languages like Amis, there are four different cases: nominative, 
accusative, genitive and locative. In fact, even more different 
cases may be distinguished in a few languages such as the Mayrinax 
dialect of Atayal. Some of these languages distinguish between 
common noun and personal noun as in Philippine languages, while the 
others make no such distinction. All Formosan languages can be 
divided into two main types, as based on whether such a distinction 
is made. 

In western Austronesian languages, a relative clause can only 
modify the subject of the sentence. Such a restriction has been 
suggested (Ross 1994) as inherited from Proto-Austronesian. Some 
Formosan languages observe the same restriction, whereas in a few 
others such as Tsou a relative clause can also modify an object. 
Again, all Formosan languages can be divided into two main types, 
a;:; based on such as distinction. Furthermore, we may object to the 
hypothesis that only the subject can be relativized and that it is 
a syntactic feature attributable to the stage of Proto- 
Austronesian. 

If we compare Formosan languages type by type, we can see 
which of these languages share more syntactic features with each 
other and which of them share fewer. 

In this paper, all examples for all Formosan languages are 
based on my own field notes, unless indicated otherwise. 

2. Case Markers on Nouns and Pronouns in Formosan languages 

Like western Austronesian languages, in most Formosan 
languages a noun is modified by a case-marking particle, which may 
or may not distinguish between a common noun and personal name 
(including a few kinship terms). All personal pronouns are 
inflected for case, and there may be different sets of case, 
including nominative, accusative, genitive, locative or oblique, 
depending on the language or dialect. 

The term "case marker" is called "construction marker" by 
other authors such as Ferreil (1979a), Tsuchida (1976, 1980). 
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2.1 Atayal 



The Mayrinax dialect of Atayal is conservative not only in 
phonology and morphology, but ' also in syntax. It retains many 
grammatical particles that have been lost in the Squliq dialects 
(see Egerod 1965, 1966, 1993). It has an obligatory case marker for 
each noun in every sentence, as in: 

Al. hibag-un na? buli? ni? yaba? ku? qulih. 
cut PF Ins knife Gen father Nom fish 

'The fish was cut with a knife by a father.' 
A2. kabaux cu? pila? ki? hakiq ?i? tali?. 
AF-borrow Acc money Acc name Nom name 

'Tali? borrowed some money from Hakil] . ' 
A3, situil] nku? ?ulaqi? ga?, rahuwal cu? matanah. 

clothes Ben child Top big Acc red 

'As for the clothes for the child, they are large and red.' 
A4. si-baiq mu cku? ?ulaqi? ku? pila?. 

BF-give my Dat child Nom money 

'The money was given to the child by me.' 

Mayrinax has the following case-marking particles: 

Nom Acc Gen Ben/Agt Dat Ins Loc 
Common: ku? cu? na? nku? cku? na? i? 

Personal: ?i? ki? ni? ni ni? 

Squliq has lost the case markers 212 and jkil, and merged rial and 
ni?. In other words, it has lost the distinction between common 
noun and personal noun. Moreover, most of the case markers it still 
retains are optional. In addition, it has undergone some sound 
changes, including c > s and k > g. Compare the case markers in the 
two dialects: 

Mayrinax: ku? cu? nku? cku? na? ni? ?i? ki? 
Squliq: qu? su? nqu? squ? na? na? — 

Moreover, Mayrinax has a much more elaborate pronominal system 
than Squliq (see Egerod 1965, 1966). Like most other Formosan 
languages, there are both long and short pronominal forms in 
Mayrinax. 





Predicate 


Nominative 




Genitives 






LV)ng 


Short 




Long 


Short 




?ikuil] 




Vi. 


Vt. 


Vi. 


Vt. 


Isg 


kuil] 


cu 


ci 
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nha? 
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There are two different sets of nominative short forms. 
Roughly speaking, one is for transitive while the other is for 
intransitive. Such a distinction is made only for the first and 
second persons singular. 

Similarly, there are also tv/o different sets of genitive short 
forms, transitive and intransitive. The "intransitive" genitive 
forms are used to modify a noun, e.g., pila? mu 'my money', while 
the "transitive" genitive forms are used to indicate a non-Agent- 
focused verb, e.g., ras-un mil 'brought by me', p-in-ana? ti' 
'carried by us (inclusive)'. 

Like all the other Atayal dialects such as Squliq, in Mayrinax 
cami 'we (exc) ' is used instead of cu or ci 'I' in such an 
expression: 

A5. ma-tutil] cami ki? ba?ay. 'I fight with Ba?ay. ' 

AF-fight Nom/we Acc name 



2.2 Seediq 



Seediq has only two case markers, ka 'nominative' and na 
'oblique^. Both markers are most often optional. There is no 
distinction between common and personal nouns. The genitive marker 
na right after a noun indicates possession or an Agent in a non- 
Agent-f ocused sentence. 



'Men are good. 



Dl. malu ka rseno. 

good Nom men 
D2. mu-nu-bahal] ku wada huqin ka pawan. 
AF-Perf-hear Nom/ I Asp dis Nom name 

'I heard that Pawan had died.' 
D3. su-sapo na lupi ka laqi na tama. 
'BF-lay Acc mat Nom child Gen father 

'Father laid a mat for the child.' 
D4. tama su-sapo na lupi ka laqi na. 

father BF-lay Acc mat Nom child his 

'Father laid a mat for his child.' 

There ar^four different sets of personal pronouns in Seediq: 
nominative, genitive, accusative and locative. Only the genitive 
has both long and short forms, and the nominative has only short 
forms, whereas the accusative and the locative have only lonq 
forms . ^ 

Personal Pronouns in Seediq 



Isg 
2sg 
3sg 
line 



Nominatives 
Lonq Short 

ku? 

su? 

ta? 



Genitives 

Lonq 

naku? 

nisu? 

neheya? 

nita? 



Short 
mu 

su?~sa- 

na? 

ta? 



Accusatives 

Lonq 

yaku? 

?isu? 

heya? 

?ita? 



Locative 

kenan 
sunan 
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lexc nami nami nami yami mian 

2pi namu namu namu-ma- yamu munan 

3pl ndeheya? na? deheya? 

Unlike Atayal, Seediq uses the first and second persons 
singular in such an expression: 

D5a. kuxun su-mu. 'You are my love = I love you (sg) . ' 

love you my 
b. sunkuxun ku isu. 'I love you (sg).' 

AF-love Nom/I Acc/you 

Atayal and Seediq are closely related. Yet they differ in that 
the former distinguishes between common and personal nouns, whereas 
the latter makes no distinction. Apparently the distinction has 
been lost in all Seediq dialects, just as in the case of the Squliq 
dialects of Atayal. 

2.3 Tsou 

Tsou has four different sets of case markers: 

Nominative: ?e, si, ta, ?o, na, co 
Accusative: ta, to, nca 
Genitive: no, ci 
Locative: ne 

For detaild descriptions and discussions of the syntactic and 
semantic functions of these case markers, see Tung (1964:147), 
Tsuchida (1976:94) and Zeitoun (1992, 1993). Given below are 
examples to illustrate the case markers: 

Tl. man?i ?e i-si eaa to c?oexa ci eosk9 ta oko. 

many Nom PF he PF-catch ne river Gen fish Acc child 

ta 
no 

'The child caught many fish at the river.' 
T2. ?o oko mo ticunu to mo yuso ci meoi si eosk9. 

Nom child Aux catch Acc Aux two Gen big Nom fish 

'The child caught two big. fish.' 
T3. smoa-zom9 ne fuel]u ne tasiona ?e amo. 
AF-shot bird Loc mountain Loc morning Nom father 

'Father shot birds in the mountain in the morning.' 
T4. sia na suu ? 'Who are you ?' 

who Nom you 

T5. mio coI]o co t?aI]o-?u. 'My leg hurts.' 
Aux hurt Nom leg my 

No distinction between common noun and personal noun is made 
in Tsou. 
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Like most other Formosan languages, Tsou has both long (free) 
and short (bound) pronominal forms. The long forms can occur freely 
like an ordinary noun, but they can only be optionally preceded by 
a nominative marker na, and nothing else. A short form is attached 
to a preceding auxiliary verb or noun. 

T6. os-ko eoobak-a (na) a?o. 'I was hit by you (sg.).' 

PF you hit PF Nom I 
T7. ta-?u m-imo to chumu. 'I shall drink water.' 

Aux I AF-drink Acc water 
T8. oh-ta ima-a (na) ?o emi. 
Aux he drink PF Nom wine 

'The wine was drunk by him before.' 
T9. i-si p-tol]s-a ta ino-si si oko. 
PF he cause-cry-PF Acc Mom his Nom child 
'The child was caused to cry by his mother.' 



Personal Pronouns in Tsou 





Free 


Bound 
Verbal 


Nominal 


isg 


a?o 


?u~?o 


?u 


2sg 


suu 


su~ko 


su 


3sg spec 




ta 


si 


gener 




si 




line 


a?to 


to 


to 


lexc 


a?mi 


mza 


mza 


2pl 


muu 


mu 


mu 


3pl spec 


hin?i 


hin?i 


hin?i 


gener 


hee 


he 


he 



2.4 Kanakanavu 

Kanakanavu has the following two (or three) sets of case 
markers (Tsuchida 1976:36-17, Mei 1982): 

Nominative: sua, sa, si 
Oblique: sua, sa 
Locative: na 

The disticntion between nominative and oblique is rather 
obscure since the same markers, sua and s^, are used for both 
cases. Moreover, these case markers are optional. 

Knl. kanakanavu sua caau iisua. (Tsuchida 1976:36) 

Nom person that 
'That person is a Kanakanavu.' 
Kn2. ni-m-ia-pacai sua caau sua tutui na tau-canum-a. (Tsuchida 
Perf-AF kill Nom person Obi pig Loc water 1976:37) 
'The person killed a pig at the place to draw wather. ' 
Kn3 . ni-m-ia-pacai avia tutui. (Mei 1982) 
Perf AF kill name pig 
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'Avia killed a pig.' 

There is no distinction for a common noun and a personal name 
as illustrated in Kn3 above. ' 

Kanakanavu also has both long (free) and short (bound) 
pronominal forms. ' 



Isg 

2sg 

3sg 

line 

lexc 

2pl 

3pl 

Kn4. 

Kn5 
Kn6 

Kn7, 





Personal Pronouns 


in Kanakanavu 




Bound 








Free 




Nominative 


Genitives 




Nominative 


Oblique 




I 


II 


III 




ku,kia 


aku 


maku 


naku 


i iku, iikia 


?ikua 


kasu 


su 


musu 


nsu 


iikasu, iimukasu 


kasua 






iisa 


nni 


iisa 


iisa 


kita 
kimi,kia 


ta 


mita 


nta 


iikita 


kitana 


mia 


mia 


nmia 


iikimi , iikia 


kimia 


kamu 


mu 


mmu 


nmu 


i ikamu , i imukamu 


kamua 


— , m-ini 


ni,kiai , ini 


ini 


r) uani 


?inia 



niani l]anai su ? 
what name your 
tia ku mi-ia-pacai caau. 
Fut Nom/I AF kill person 
TRa-licdpd kasu ?uucu ? 
AF fear Nom/you ghost 
c-um-aca?ara kasu ?ikua. 
AF see Nom/you Obl/me 



'What is your name ?' 
'I shall kill a person.' 
'Are you afraid of ghosts ?' 
'You see me. ' 



2.5 Saaroa 



There are only two sets of case markers in Saaroa; 
Nominative: ka, a (wa, ya^ 
Oblique: na 

All of them are optional (indicated by parentheses) in the 



'The dog died. 



Srla. maci?i (a) tasau. 
AF-die Nom dog 
b. naci?i (ka) tasau. 'The dog died.' 

Sr2 . um-au-a-u a mamaini na vutukulu. 
AF R eat Nom child Obi fish 
'The child kept eating fish.' 
um-a-ala cucu?u na vutukulu na luulul) . 
AF catch person Obi fish Obi river 
'The person caught fish in the river.' 
t-um-a-tutulu a ina-ku .na mamaini na kari. 
AF- teach Nom Mom my Obi child Obi language 

'My mother taught language to a child.' 

Like Tsou and Kanakanavu, Saaroa does not distinguish between 



Sr3 



Sr4 
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a common noun and a personal noun. The same case markers may 
precede both nouns. For example, 

Sr5. ii-m-ari-vak3s9 ka al]ai na mamaini. (Ting, MS) 
Perf AF beat Nom name Obi child 

'A]}ai has beaten a child.' 

Somewhat like Kanakanavu, the nominative marker ka in Saaroa 
may precede an Agent in a non-Agent-f ocused sentence: 

Sr6. sa-alu-a ka cucu?u kana?a ka vutukuiu. 
RF catch Obi person that Nom fish 
'The fish was caught by that person.' 

Similar to Tsou and Kanakanavu, Saaroa has both free and bound 
pronouns. The general free forms may appear as the topic or object 
of the sentence, while the two sets of bound forms are suffixed 
either to the verb as the nominative marker or to the noun as the 
genitive marker. In addition, the genitive bound forms may combine 
with na- as free forms. 

Sr7. iiaku k-um-ita vuii?i. 'As for me, I saw a snake.' 

Top/ I AF-see snake 
Sr8. ii-k-um-ita-aku iiau. 'I have seen you.' 

Perf AF see Nom/ I Obl/you 
Sr9. kc«ni?i ia isikana-ku. 'This is mine.' 

this Top possession Gen/I 



Personal Pronouns in Saaroa 





Topic 


Nominative 


Genitive 


Oblique 


isg 


iiaku 


-aku 


-ku 


na iiaku 


2sg 


iiau 


-u 


-u 


na iiau 


3sg 


iiaisa 




-isa 


isana 


line 


iiata 


-ita 


-ta 


na liata 


lexc 


iiaiamu 


-amu 


-iamu 


na iiaiamu 


2pl 


iiamu 


-mu 


-mu 


na iiamu 



2 . 6 Ruka i 



The Tanan dialect of Rukai has the following case markers (Li 
1973:84-94) : 

sa Accusative common noun 
ko Nominative personal name 
ki Accusative personal name 

RTl. aw-baay nakoa sa omas ?asilalak ko tina-li. 

AF give Acc/me Acc person adopt Nom Mom my 
'My mother gave me away to a person as an adopted child. ' 
RT2 . ko maLeQa, "tobaasa ina kensas sa gooQ " amia ki doLay. 

Nom name cook soup police Acc cow so Acc name 
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'MaLeQa said to DoLay, "Cook beef soup for the police !' 

Tanan distinguishes between a common noun and a personal noun, 
as illustrated above. The distinction seems to have been lost in a 
closely related dialect, Budai, in which both ko 'nominative' and 
ki 'accusative' occur before common nouns and personal names, as in 
the examples below: 

RB3. ko karaba la moaDil] ki da3. 

Nom pangolin then AF-enter Acc ground 
'The pangolin then entered the ground.' 

RB4 . la kSla ko soLaw o-pala-palal] ki balSl] . 

then come Nom snake AF woo Acc name 

'Then the snake came to woo BalSl] . ' 
RB5. papacay ko saovalay sa ababay. 'The man killed a woman. ' 

kill Nom man Acc woman 

The difference between ki and sa in Budai is that the former 
is definite and the latter indefinite. 

In the Maga dialect of Rukai, there are the following case 
markers : 

ki Nominative, human noun, common or personal 
na Accusative, non-human, common noun 
- ana Accusative, personal 

ku marker leading a relative clause (indicated by [ ] ) 

RG6. u-stiti r)kua ki toto. 'Toto beat me.' 

AF-beat Acc/me Nom name 
RG7. u-stiti vlavlaki ki marDalji. 'The old man beat a child . ' 

AF-beat child Nom old 
RG8. u-kani DaDoIJ o na blibli. 'The monkey ate a banana.' 

AF-eat ir\onkey Acc banana 
RG9. kamdu maa ki vakau na mu-ini iOd-i na biki . 
die will Nom name Acc go his feed Acc pig 
'Vakau will die if he goes to feed a pig. ' 
RGIO. n-u-be maa pesu Dia vakav-ana ki pipeci . 
Fut give money Acc name Acc Nom name 

'Pipeci will give money to Vakau.' 
RGll. u-rgu musu ku [aci-Da [ku amua takihoku] ] ? 
AF-know Nom/you who 3inv went Taipei 

'Do you know who went to Taipei ?' 
RG12. ika-Da ku [p-ika ki ipulu alapi] ? 
where put Nom name stone 

'Where did Ipulu put a stone ?' 

Instead of making a distinction between a common noun and a 
personal noun as in Tanan and Tons (see below) , Maga distinguishes 
between human and non-human nouns. This may be considered a unique 
syntactic development in Maga among all Ruka dialects. 
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The Tona dialect of Rukai has the following case markers: 
ku Nominative, common 
ki Nominative, personal 
na Accusative, Locative 

RN13. ki-a-kana ku baiabaia na aDawana. 
PF eat Norn banana Acc monkey 
'The banana was eaten by a monkey.' 
RN14. w-a-kan? ki ?ipulu na vdldvdld . 
AF eat Norn name Acc banana 
'Ipulu ate a banana.' 
RN15. maga?aucu na valavalak3 ku maruDaI]9. 
scold Acc child Nom old 
'The old man scolded a child.' 
RN16. pa-ua na kay 1 'Put it here !' 

put Loc here 

All these case markers are mostly optional in the sentence. 
However, ku may not be deleted when the noun it modifies is 
topical ized, as in RN16, or when there is agreement between the 
predicate and the subject, as in RN17: 

RN17. ku cu-cumay kiakan9 na ikulaw. 

Nom bear PF-eat Acc leopard 
'The bear was eaten by a leopard. ' 
RN18. ?iakai-ni ku valisan9 ? 
exit its Nom wild pig 
'Where is the wild pig ?' 

Syntactically Mantauran differs from the other Rukai dialects 
in the following respects: 

1. It has no passive construction indicated by ki- like the other 
dialects. A passive meaning is indicated by an accusative 
pronominal marker on the verb (see examples below) . 

2. It has developed an object-verb agreement (see examples below), 
not found in any other Formosan language. The agreement occurs 
only between the verb and a human object or an object possessed 
by a human. 

3. It has few case markers. Hence the subject and object of the 
sentence are determined by the unmarked word order of VOS. The 
only case marker 2X 'definite nominative' may have derived from 
the demonstrative 5ona?i 'that'. 

4. Almost all of its personal pronouns are bound. 

RM19. okana wdldwdld mavoroko. 'The monkey ate a banana.' 

eat banana monkey 
RM20. okan-in3 v3l9v3lc) mavoroko. 
eat Acc/him banana monkey 
'The banana was eaten by a monkey.' 
RM21. okan-in3 v9l3vc)lc)-ini tamatama. 
eat Acc/him banana Gen/his father 
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'Father's banana was eaten by someone.' 
RM22. okana-I]-ia9 vaiavSia-li ocao. 

eat Perf Acc/me banana my person 
'My banana has been eaten by a person.' 
RM23. ?okolo6"iaa ?i pdldl\d. 'The ghost is afraid of me . ' 

fe^r Acc/me Nom ghost 
RM24. ?oponoho ka-li. 'I am not from the village of ?oponoho.' 

place not I 

Except for Mantauran, all Rukai dialects have free personal 
pronouns, which usually occur in the topic position. They all have 
bound forms for the nominative, accusative and genitive. See Li 
(1977: Appendix, 1994), and Zeitoun (1994) for the personal nouns in 
the five major Rukai dialects. 

2.7 Bunun 

There is no distinction between common noun and personal noun 
in Bunun. 

According to Ogawa and Asai (1935:588), Bunun had a contrast 
betveen the nominative marker as and the oblique marker is (in the 
Northern and Central dialects) or mas (in the Southern dialect). 
However, in the Isbukun dialect of Bunun, there is only one case 
marker mas that may precede an object, as in BSl below. The 
grammatical particle tu between two nouns indicates subordinate 
relationship; of. BS2a and b: 

BSla. m-aun ?utul] mas bunun. 'The monkey ate banana.' 

AF-eat monkey Acc banana 
b. kaun-un mas ?utul]-cia? bunbun. 
eat -PF Acc monkey that banana 
'The banana was eaten by that monkey.' 
BS2a. mataisah bukun mas cina?. 'Bukun dreamed of Mother.' 
AF-dream name Acc Mom 
b. mataisah bukun tu cina?. 
AF-dream name Mom 
'Some one dreamed of Bukun 's mother' 

In most Austronesian languages, case markers precede the noun 
they modify. Nevertheless, the nominative and accusative markers 
usually appear as suffixes to nouns in Bunun, as in B3S-4. The 
case-marking affixes may have derived from demonstratives. This can 
be considered a unique syntactic feature developed in the Isbukun 
dialect of Bunun. 

BS3. makua? cina?~a? si?al (mas) ?uva6-cia? ? 

how Mom Nom/that coax Acc child Acc/that 
'How does that mother coax that child ?' 
BS4. manah tama?-an ?aval-tan. 

shoot Dad Nom/this squirrel Acc/this 
'(This) father shot this flying squirrel here.' 
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In the Takbanuath (a Central) dialect of Bunun, as reported by 
Jeng (1977) , there are several case markers that precede nouns, 
including: 



a Nominative (Jeng 1977:185) 

ka Nominative (Jeng 1977:185) 

ki Accusative (Jeng 1977:262) 

i Locative or accusative (Jeng 1977:107) 

See Jeng 1977 for examples in sentences. 

There are four different sets of pronouns in Bunun: nomina- 
tive, accusative, genitive and locative. There are both long and 
short forms in the first three sets, whereas in the fourth set 
there are only long forms. See Li 1994 for Isbukun examples for all 
pronominal forms in sentences. 



Personal Pronouns in Bunun (Isbukun) 



Nominative 



Accusative 



Long 

Isg saikin 

2sg ka-su? 

3sg Cl saian 

Di saia? 

line ka-ta? 

lexc ka-imin 

2pl ka-mu? 

3pl naia? 



Short 

-ik 

-as 



-ta? 

-im 

-am 



Long 
Qaku? 
suu? 

saicin 

saicia? 
mita? 
Qami? 
muu? 

naicia? 



Short 
-ku? 
-su? 



-ta? 



-mu? 



Genetive 
Long 
?i-nak 
?i-su? 

?isaicia? 
?i-mita? 
?i-nam 
?i-inu? 

?inaicia? 



Short 
-nik 
-su? 

saicia? 
-mita? 
-nam 
-mu? 

naicia? 



Locative 

Qaku-an 
su?u-an 
sai-an 

mita-an 
Qami-an 
mu?u-an 
nai-an 



As in the other Formosan languages, the pronominal forms vary 
from dialect to dialect in Bunun. Phonologically , perhaps also 
syntactically, the northern dialects of Bunun are more conservative 
than the others. Given below for comparison is the pronominal 
system in Takituduh, a northern dialect of Bunun. 

Personal Pronouns in Bunun (Takituduh) 



Isg 

2sg 

3sg 

line 

lexc 

2pl 

3pl 



Nominative 
Long Short 



?aQak 
?asu 
cia 
?ata 
?aQam 
?amu 
nai 



-ak — ku 



-ta 



Accusative 
Long Short 
Qakun 
?asu 
cia 
?ita 
Qami 
mu?un 
nai 



-ta 



Genitive 
Long Short 
?inak -nak 
?isu?u -su?u 
?icia 

?imita -mita 
?inain -nam 
?imu -muu 
?inai 



Locative 

Qakuan 
su?uan 

?ita?an 

Qamian 

mu?uan 



BN5. ma-ludaq ?asu Qakun. 

AF hit Nom/you Acc/me 
BN6. ma-ludaq cia ?ita. 

Nom/he Acc/us 



' You (sg) hit me. ' 
'He hit us (inc) . ' 
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BN7 . caiv-i ?a6ak hutan. 

give PF/Imp Nom/I potato 
BN8. ?inak tu ?uva?a6. 

Gen/I child 
BN9. muhaan-ak su?uan. 
AF-come Nom/I Loc/you 

2.8 Paiwan 



'Give me sweet-potatoes ! ' 
'(He) is my child.' 
'I came to your place.' 



There is personal and non-personal distinction in Paiwan nouns 
(Ogawa and Asai 1935:137, Ho 1978). 



Common : 
Personal sg. 
Personal pi. 



Nominative 
a 

ti 

ti-a 



Genitive 
na, nua 
ni 

ni-a 



Oblique 
ta, tua 
tjay 
t jay-a 



ocative 



There are four different sets of personal pronouns: 

nominative, genitive, accusative and locative, and only the 

nominative and genitive have both long and short forms (Ogawa and 
Asai 1935:138, Ho 1978): 



Nominative 

Long 
Isg tiak3n 
2sg tisun 
3sg timadju 
line titjSn 
lexc tiam3n 
2pl timun 
3pl tiamadju 

2 . 9 Puyuma 



Short 
-ak3n 
-sun 

-it j3n 
-aman 
-mun 



Genitive 

Long 

niak3n 

nisun 

nimad ju 

nit j3n 

niamSn 

nimun 

niamadju 



Short 

ku- 

su- 

t ja- 
nia- 
nu- 



Accusative 

tjanuak3n 
t j anusun 
t jamadju 
t januit j3n 
t j anuam3n 
t janumun 
t jaiamadju 



Locative 

t j it januak3n 
t j it janusun 
t j imadju 

t j itjanuit j3n 
t j inuam3n 
t j it janumun 
t j iamad ju 



There are two major dialects of Puyuma, Nanwang and the 
others, including Katipul, Tamalakaw, Rikavong, etc. Nanwang is 
more conservative ir. retaining the voiced stops, which have all 
changed to fricatives in the other dialects. It is assumed that 
Nanwang may also be more conservative in syntax. In fact, both 
dialect groups have the same or similar case m; rkers for nouns; see 
Cauqelin (1991) for Nanwang and Tsuchida (1980) for Tamalakaw. 

Puyuma is a VOS type of language. It distinguishes between 
common noun and personal noun. It has four sets of case: 
nominative, genitive/agentive , oblique and locative. 



Common specific 

unspecif ic 
Personal singular 
plural 



Nominative Oblique Genitive/Agentive Locative 

na kana kana 

a Da Da i 

i kan 
na kana 



ERIC 



■282- 



93 



PI, 



P2 



P3a 



P4 



P5, 



P6 



P7 



'The baby cried. ' 
'I like children. 



'Mother returned home' 
'Mother and others returned home. ' 



maTal] is na manud3l] . 

cry Nom baby 
sagar ku Da walak. 
like Nom/ I Obi child 
. muruma? la i ina. 

return already sg Mom 
. muruma? la na ina. 
returen already pi Mom 
tu bSray-ay Da walu kan tina-taw iDina walak 
give RF Obi candy Pers sg Mom his this chi Id 
'This child was given some candy by his mother.' 
tu kan-aw kana walak iDina bu?il. 

eat PF Gen-Com-sp child this taro 
'This taro was eaten by a child.' 
na m3kan kana radis i: a walak. 
Nom-sp eat Obi peanut Nom-nonsp child 
'The one who ate peanuts was a child.' 
nanu ruma? i puyuma. (Cauqelin 1991) 
your house Loc 
'Your house is at Puyuma.' 



Personal Pronouns in Puyuma 



Nominative 

Long Short 

Isg kuyku -ku 

2sg yuyu -yu 

3 taytaw 

line tayta -ta 

lexc mimi -mi 

2pl muymu -mu 



Genitive 
Long Short 
naqku -li 
nanau 

nantu -taw 
nanata 
naniam 
nanmu 



Accusative 

kal] ku 

kanu 

kantaw 

kanta 

kaniam 

kanmu 



(Cauqelin 1991) 
Agenitive 



-ku 

-nu 

-tu 

-ta 

-niam 

-mu 



Given below are examples to illustrate some personal pronouns 
is sentences: 



'Have you(sg) eaten already ?' 
'You(pl) sawus(exc).' 
'He saw us (inc) . ' 



P8. m3kan yu la ? 

AF-eat Nom/you already 
P9 . m3na?u mu kaniam. 

see Nom/you (pi) Acc/us(exc) 
PIO. taytaw ra3na?u kanta. 

he see Acc/us(inc) 
Pll, s3LTa?-ay kuyku. 'I was slapped on the face by someone, 

slap RF Nom/ I 

2.10 Amis 

Amis distinguishes between common noun and personal noun. It 
has four different sets of case markers: nominative, genitive, 
accusative and locative. 

Amis has a variety of dialects. The most conservative dialect 
is Sakizaya, which retains more archaic phonological and lexical 
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features than the other dialects. Unfortunately I have no data for 
Its grammar. This study is based on the central dialect, which is 
most widely used among all the Amis. 



Common : 
Personal : 



Nominative 
ko 
ci 



Genitive 
no 
ni 



Accusative 
to 

ci. . . -an 



Locative 
i 



'The child cried at home. 
'The father hit a child. 



Ml. t-oiu-al] ic ko wawa i lumaq. 

AF- cry Nom child Loc house 
M2 . mi-palo ko mama to wawa. 

AF-hit Nom father Acc child 
M3. palo?-an no mama ko fol]oh no wawa. 
hit PF Gen father Nom head Gen child 

'Father hit the child's head.' 
M4. mi-cahiw ci raraq ci tafiq-an. 'Raraq persecuted Tefiq. ' 

AF-persecute Nom Acc 
M5. patay-an no ina ko fafoy to cahiw. 

kill PF Gen Mom Nom pig Acc hunger 

'The pig was starved to death by a mother.' 
M6. patay-an ni raraq ci tafiq. 'Tefiq was killed by Raraq. ' 

kill PF Gen Nom 

Regardless of the variant word order VSO~VOS in the surface 
structure, the Agent always precedes the Patient. 



Personal Pronouns in Amis 



Isg 

2sg 

3sg 

line 

lexc 

2pl 

3pl 



Nominative 

kako 

kiso 

cil] ra 

kita 

kami 

kamu 

cal] ra 



Genitive 
I 

mako 
miso 
nira 
mita 
niam 
namu 
nal] ra 



II 

-ako 
-iso 

-ita? 
-ami 
-amu 



Accusative 

takoanan 

tisoanan 

cil]ra?an(an) 

tita?anan 

tamianan 

tamoanan 

cal] ra?an (an) 



'My child was hit by 



me, 



M7. palo?-an-ako ko wawa?-ako. 

hit-PF Gen/ I Nom child Gen/ I 
M8. ci raraw ko mi-palo?-ay, ca?ay-ay kako. 
Pers name Nom hit NM not NM Nom/ I 
'It in Raraq who hit (him), not me.' 
M9. ma-tawa ko tamdaw takoanan. 'The person laughed at me. ' 

laugh Nom person Acc/me 

2.11 Pazeh, Kavalan, Thao and Saisiyat 

I (Li 1978) described the case-marking systems of the four 
less known Formosan languages (Pazeh, Kavalan, Thao and Saisiyat) 
m some details. Readers are referred to the paper for their case 
markers for nouns and pronouns and their examples in sentences. 
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Here I shall briefly cite only the case markers on nouns for 
convenience of a general comparison. 



Pazeh has the following four sets of case markers for noun: 
ki Nominative marker 

u Accusative marker, coordinate marker 
ni . nu Genitive markers 
di Locative marker 

Kavalan has the following case markers: 

a(wa,ya) Nominative for a common noun 

ti Nominative for personal 

tu Accusative for general and non-human 

ta Accusative for human 

na Genitive for common noun 

ni Genitive for personal 

ta-. . .an Locative 

Thao has the follov/ing case markers: 
na Nominative for a common noun 

ti Nominative for a personal including kinship terms 
sfa) Nominative or accusative for general 
tu Accusative 
?i Locative 

Saisiyat has the following case markers: 
ka Accusative 

noka Genitive for common noun 
ni Genitive for personal name 

no Genitive for nonpersonal proper noun, also Benef active 
ray Locative or directional for common noun 
kah Directional marker for personal name 



3. summary and Discussion 

Some Formosan languages distinguish between the common noun 
and the personal noun like the Philippine languages, whereas the 
others do not. Such a distinction is made in Atayal (Mayrinax) , 
Rukai (Tanan, Maga, Tona) , Paiwan, Puyuma, Amis, Kavalan, Thao and 
Saisiyat. The distinction is not made or found in Seediq, Tsou, 
Kanakanavu, Saaroa, Rukai(Budai, Mantauran) , Bunun or Pazeh. The 
distinction seems to have existed at an earlier stage but lost in 
some modern languages and dialects, including the Squliq dialect of 
Atayal, Seediq, the Budai and Mantauran dialects of Rukai. However, 
there is no evidence that such a distinction ever existed in the 
Proto-Tsouic language. The Tsouic la'iguages differ from most other 
Formosan languages, the Atayalic and the so-called "Paiwanic", in 
this respect. 
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The case markers in most of the languages have relatively 
simple systems. Only two of them have more Cilaborate systems, i.e. 
the Mayrinax dialect of Atayal and Puyuma. Have these two languages 
developed elaborate systems of case markers on their own, or have 
the others simplified their systems ? This requirtiss further study. 

All Formosan languages have fairly elaborate pronominal 
systems, and most have three or more different sets of pronouns: 
nominative, genitive, accusative and/or locative. The pronominal 
systems are in general more complicated than those of case markers 
for nouns in these languages. Moreover, most of the languages have 
developed the short forms of personal pronouns. Thus they have both 
long (free) and short (bound) forms for one or more sets. Only a 
few have not, including Saisiyat and Thao, and these languages are 
more conservative in this respect. The other extreme in the 
development of personal pronouns is that a language, such as the 
Mantauran dialect of Rukai, has only bound forms, and no free 
forms. Such a language has undergone one of the most drastic 
syntactic changes. By comparing the different case markers in 
Formosan languages, we can get a better picture of how they differ 
from each other in the evolution of their case-marking systems. 

List of Abbreviations 

Acc Accusative 
AF Agent-focus 
Agt Agent 

Aux Auxiliary ver>> ^. 

Ben Bp*" ' ' ^ ,^ 

or rJenef active-focus 

Dat Dative 

exc exclusive 

Gen Genitive 

gener general 

inc inclusive 

Ins Instrument 

inv invisible 

LF Locative-focus 

Loc Locative 

NM Nominalization 

Nom , Nominative 

Obi Oblique 

Perf Perfective 

Pers Personal 

PF Patient-focus 

PI Plural 

R Reduplication 

RF Referential-focus 

Sg singular 

spec specific 

Top Topic 

Vi intransive 

Vt transitive 
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AFTER BEING REFUSED: 
RESPONSE TO FACE-THREATENING SPEECH ACTS 



Chao-chih Liao 
Feng Cilia University, Taichung Taiwan 

Abstract 

This is a sociolinguistic study, researching how Chinese people in Taiwan react or 
redress in Mandarin Chinese to save face (for 'self and/or for 'other') after they are 
refused. The post-refusal reaction includes the reaction after the refuser's rejection to 
requests, invitations, offers, and suggestions. Twenty-three strategies are concluded in 
the post-refusal speech acts. The twenty-three first-order strategies are further 
summarized in five second-order super-strategies or maxims. The highest-order 
principle in the speech act is politeness and/or face-saving theory. 

The strategies of the post-refusal speech acts include 1) silence, 2) changing the 
topic, 3) counter-rationale, trying to persuade the refuser, for example, promising early 
retribution, 4) complying with the refuser, 5) external yes, internal no, 6) saying thank 
you 7) criticism, 8) threat, 9) reprimand, 10) offering self an alternative, 11) modifying 
one's request, for example, reducing the amount of borrowing, 12) asking the refuser 
to find an alternative, 13) explanation of the justification of the request, 14) 
postponement, .5) giving a lesson, 16) repeating the request, 17) comforting the 
refuser and/or self, 18) showing surprise, 19) asking for explanation, 20) showing 
disappointment, and/or regret, 21) a composite strategy, 22) topic-closing expressions, 
and 23) joking. 

The methods used in summarizing post-refusal speech acts into twenty-three 
strategies include the intuition of the researcher as a native Mandarin speaker in 
Taiwan, participant observations, and role playing by undergraduates to fill in the 12 
DCT items devised by Takahashi et al. (1986), which were modified and translated into 
Mandarin Chinese by the researcher. Takahashi et al. originally devised the 12 DCT 
items for subjects to fill in words of rejection with the known parts of the requester's 
request and the requester's post-refusal expressions. My subjects are given Takahashi et 
al. s twelve situations and the requester's request. They are asked to make up the 
conversation in which the interactant refuses and the following post-refusal speech until 
the dialogue comes to a natural end. 

The factors infiuencing post-refusal speech acts include the property of the request 
(or rank of imposition in Brown and Levinson's (1987) term), the interactants' relative 
social status, the familiarity or social distance of the interactants involved, the sex of 
the interactants, the way one is refused, the age of the interactants and each individual's 
speech habits. For example, if the refuser is of higher status, the refusee may simply 
say THANK YOU after being refused. If the refuser is a close friend, the refusee may 
react by reprimanding, giving a lesson to the refuser, complaining or appealing to 
feelings, threat, and so on. The post-refusal speech acts are also affected by the way 
one is refused. For instance, if the refuser refuses by offering an alternative, the 
refusee may respond with the explanation of the impossibility of the alternative, or 
saying s/he will try to follow the refuser's suggestion of the alternative. 

Besides the revised data from Takahshi et al. to be used for this study, we also 
devised six role-play situations for students to fill in the words they would say after 
they are refused. For most situations, there are 100 subjects, 50 males and 50 females, 
participating in the tasks. There are 596 subjects participating in the study. After 
collecting the data, three statistical methods-factor analysis. Student's t-test, and the 
Chi-square test for homogeneiiy-are used to analyze Ihern. 

The factor analysis and Student's t-test are for testing whether males and females 
belong to the same culture within the macro-Chinese culture. The statistical methods 
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prove they are from different micro-cultures, which seems to explain why females are 
different from males in reaction to refusal. The Chi-square test is for testmg the 
homogeneity of two sexes in the strategy use and whether thev (both males and 
females) use the same strategy to refuse the family member, a close friend, and an 
acquaintance. 

1. Introduction 

Gass and Houck (1993: 1) point out that 'refusals are a highly complex act 
primarily because they often involve lengthy negotiations to accommodate the 
noncompliant nature of the speech act. Because refusals normally function as 
responses, they preclude extensive planning on the part of the refuser. That is, they are 
played out events, rather than instances characterized by a brief exchange or single 
utterance.... A refusal... without some kind of elaboration or follow-up can have severe 
social consequences.' 

Why are refusal-sequences complex? Primarily because they are related to face. 
'Face is something that can be lost, maintained, or enhanced, and any threat to face 
must be continually monitored during an interaction. And, since face is so vulnerable, 
and since most participants will defend their face if threatened, the assumption is made 
that it is generally in everyone's best interest to maintain self's and each other's face 
and to act in such ways that otherj: are made aware that this is one's intention (Fraser, 
1990: 229).' What makes acceptance-refusal-sequences more complex is that Chinese 
people in Taiwan may say 'no' to mean 'yes', or vice versa. Sometimes after the 
acceptance-refusal negotiations, the inviter or the requester might still be at a loss as to 
w'lat the invitee or the requestee's real intention is (Lii-Shih, 1994). 

In Liao (1994), I talked about refusal phenomena in Mandarin Chinese. In this 
short article, I will discuss how the requester (s/he is also the refusee) and the requestee 
(also the refuser) redress or defend their own face in the post-refusal phase. Since 
refusal events are FTAs (face-threatening acts), after FTAs there, should be some re- 
dressive speech acts to compensate the face lost for the requester and/or the refuser. In 
Mandarin Chinese we call it xia-tai-jie ('stepping-down the stage'). 

The purposes of the study are three: to summarize post-refusal speech acts into 23 
strategies based on relevant Mandarin Chinese data, to propose the politeness maxims 
of the post-refusal speech act strategies, and to compare the females and males in the 
strategy use of post-refusal speech act. Of course, the examples in Mandarin Chinese 
offer MFL/MSL learners (learners of Mandarin as a Foreign/Second Language) the 
exact words native speakers use as refusing and post-refusal expressions'. 

2. Literature Review 

Liao (1994) indicates there are two kinds of request: one is the request to help, 
offer, suggest, and invite in favor of the addressee (an FSA request, a face-satisfying 
act)-; the other kind is the request of the addressee's time and money in favor of the 
requester (an FTA request-the narrow meaning of request). The pre-requisite of the 
FSA request is that if the requester does not request, the responder does not ask for the 
requester's help, offering, suggestion, or invitation; and the pre-requisite of the FTA 
request is that if the requester does not request, the responder would not do it. For 
addressee-beneficial FSA-requests, the responder, especially in Oriental countries, such 
as Taiwan and Japan, must learn the art of acceptance by first refusing. After s/he finds 
that the requester is really sincere, s/he accepts but asks the FSA givers not to spend a 
lot of time or money on it. For FTA-requests, one also has to learn the art of refusing 
as well as the art of the post-refusal speech act because face is a universal concept to be 
maintained by all human beings. 

Making a request may be an FTA, rejecting a request is surely an FTA. (Lii- 
shih, 1986; Brown and Levinson, 1987; Liao, 1994). Making a request, in its narrow 
meaning, violates H's 'face want' of being unimpeded and rejecting a request violates 
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H's face want of being accepted. A second difference between the two kinds of FTAs 
is that to make a request, the requester can prepare how to utter it for a long time, 
consulting others, writing a draft on what to say to reduce t!ie effect of an FTA. S/he 
can prepare what and how to respond if being refused or accepted. However, refusal is 
a response to S's request, the refuser must act in an impromptu way. The impromptu 
way of refusing is also greatly affected by the personality of the refuser and the way the 
request is made. The post-refusal speech act may be an FTA or an FSA. If the refusee 
complied with the refuser immediately, it is an FSA for the refuser. If the refusee 
offers counter-rationale, asks the refuser to find an alternative, asks for explanation, 
threatens, etc. then, it is an FTA for the refuser. The post-refusal speech act on the part 
of the requestee (also the refuser) can seldom be prepared. 

In this article, I deal with FSA and FTA request together. In later studies, I will 
deal with them— offer, suggestion, invitation, request, etc.— individually. 

Women's language has been speculated to be different from men's (Jespersen, 
1922; Lakoff, 1975). It is generally agreed that boys and girls have been treated and 
interpreted differently shortly after they were born, for example, infant's cry is inter- 
preted as anger when listeners are told the baby is a boy, and as fear when they are told 
It is a §irl (Condry and Condry, 1976). It is generally agreed that boys and girls grow 
up mauily in interaction with people of the same sex; therefore, boys and girls 
gradually develop their awn way of speaking. When women speak, they intend to 
create rapport and when men speak they intend to assert or resist control and the speech 
strategies may thus vary by sex (Kramarae, 1981). Different micro-cultures of boys 
and girls result in different speech strategies and purposes. In this study, we will prove 
that men and women are different in post-refusal speech strategies. 

2.1 Language Specific vs Language Universal 

Little study has been done concerning the post-refusal speech act, with the 
exception of indirect data used in refusal studies, such as Takahashi et al. (1986) and 
Liao (1994), in which DCT items of the request and the post-refusal speech are given 
to the subjects to fill in the refusal words. One of Takahashi et al.'s findings about the 
difference between Japanese and American ways of refusal is that the Japanese are 
more status sensitive than the American-this is a well-known fact about the Western 
and Eastern societies. The Japanese seems to use expressions of regret more frequently 
with higher-status interlocutors, but less frequently with lower-status interlocutors. In 
this post-refusal speech act study, I would like to contend that in Taiwan, people use 
more of the strategy of 'saying thank you' with the higher-status refuser than with the 
lower-status refuser. The comparative study of four cultures— American, Taiwanese, 
Singaporean, and Japanese-in four areas concerning post-refusal speech act is under 
way by Bresnahan, Liao, Kuo and Takashima. 

3. The Study 

Based on the researcher's intuition as a native speaker of Mandarin Chinese in 
Taiwan, participant observations, and role-played conversations written by adult native 
speakers, twenty-three post-refusal speech act strategies are firstly obtained (Section 
3.1)3. 

Then, two quantifiable sub-studies were devised. 

Sub-study I is the translation into Mandarin Chinese and modification of 
Takahashi et al's 12 DCT items-so that only the first utterances of the requester's 
request, offer, invitation, and suggestion are left-for the native Mandarin speakers in 
Taiwan to make up the conversation in which the interactant refuses and the following 
post-refusal conversation until the dialogue comes to a natural end. Takahashi et al.'s 
original items are for subjects to fill in words of rejection with the known parts of the 
requester's request and the requosler's post-refusal reaction. My subjects were only 
given Takahashi et al.'s twelve situations and the requester's request without the 
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requester's post-refusal reaction. 

To make things easier, unlike Sub-study I, Sub-study II concentrates on FTA 
requests only. It is divided into three parts. Tne first part is for subjects to fill in what 
they say in refusing a request, which will be discussed in the joint publication of 
Bresnahan, Liao, Kuo, and Takashinia. The second part consists of eleven questions 
for males and females to make judgement about their own features, which form the 
basis of the argument that males and females come from different micro-cultures within 
the same Chinese macro-culture (see Section 5). The third part is the questionnaire for 
filling out post-refusal speech after being refused of six FTA requests, which will be 
discussed in Section 5.1. 

3.1 The twenty-three post-refusal speech act strategies 

Before talking about the twenty-three post-refusal speech act strategies, one thing 
must be made clear. There are many strategies which are similar, for example. 
Strategy 3 'Counter-rationale', Strategy 11 'Modifying one's request'. Strategy 13 
'Explanation of the justification of the request', and Strategy 16 'Repeating one's 
request'. To make the difference between Strategy 3 and Strategy 16, we call it 
Strategy 16 if the refusee does not present more reasons, but only add the word zhetide 
'really' (For example, if being refi ■ d of salary raising, the worker repeats the request 
by saying, "Dashi wo zhende xuyao iiaxin (But I really need add-salary) 'But I really 
need the raise of the salary.',") to the originally request. If the refusee offers more 
reasons and refutes the explanation of the refuser, then it is Strategy 3. If the refusee 
tries to give objective facts to justify the request, it is Strategy 13. If the content of the 
request is changed, then it is Strategy 11. 

The strategy of reprimand and that of giving the refuser a lesson may be 
ambiguous too. If the high-status refusee gives a lesson to the low-status refuser, the 
latter m<w treat it as reprimand, and the high-status treats it a lesson. 

1 . Silence and/or going away 

As a participant observer, I grasped the following request-refusing negotiation in 
the library. 

(At the copyer in the library at 1:45) 
Student A,: Qing wen ni hai yao yin ji ye? 

Please ask you still want copy how-many pages? 

'How many pages do you still have to copy?' 
Student B,: San-shi dwo ye. 

thirty more pages. 

'Over thirty pages.' 
Student A^: Rang wo xian yin hao-ma? Wo zhi yao yin 

san ye. 

Let me first copy OK PRT? I only want copy 
three pages. 

'Can you let me copy first, because I have only three 
pages to copy. ' 
Student B^: Wo zai gan shi-jian. Wo liang-dian you 
shi. 

I am hurry time. I two-o'clock have things. 
'I am in a hurry; I must do something else at two 
o'clock. ' 

Student A.,: Ke-shi wo zhi-you san-ye, er-qie wo yi-jing zhao 
hao le. 

But I only have three pages, besides I already find 
OK PRT. 

'But I only have three pages to copy; besides I have 
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found the pages to copy. ' 
Student By (Keep silent, neglecting A's request, and continue 
copying) 

In the above conversation. A, is a pre-request question, in which A tries to justify 
her requesf*. A^ is the request followed by B^, in which B explains why she cannot 
agree to A's request. Then A3 is the use of "the strategy of 'repeating the request' 
(Strategy 16 in this study) in another expression. B3 uses the strategy of 'silence' as 
post-refusal speecii act which confirms her intention as refusal. 

Afte • being refused by a high-status individual, if the refuser uses the strategy of 
topic-closing expression, for example, he says, "Xiaban hou zai taolun ruhe jiejue, xian 
hwei ganwei ba\ (Off-duty after gain discuss how solve, first return position PRT!) 
'Let's talk about it after duty. Now you 9-. back to your job.'," the refusee will have 
no choice but keep sil^it and leave. 

It is also possible that after the high-status refuser explains the reason for refusal, 
then the requestee goes away immediately to avoid more counter-rationale from the 
low-status. 

2. Changing the topic 

In one of the examples mentioned above, ..*^ter refusing the worker's request for 
salary raising by a postponement strategy and the worker's insistence on knowing the 
answer right away, the boss confirms his refusal intention by saying, 'Let's talk about 
it after duty. Now you go back to your job," which is a composite strategy of first 
postponement and then changing the topic. 

After being refused by the low-status worker to work overtime with the strategy 
of explanation that he must take his child to see a doctor after work, the boss changes 
his topic by saying, "Bing de yan-bu-yan-zhongl (Sick PRT serious-not-serious?) 'Is 
your child seriously sick?'," to show his concern of the worker's family, which also 
implies that the boss has co'nplied with the refuser— the worker. 

It seems that if the refuser uses the strategy of explanation which is conventionally 
worth congratulating or consoling, the refusee will surely change the topic to 
congratulate or console the refuser, which also implies that the refusee has complied 
with the refuser. Another example given by one informant is that on hearing the 
refuser say that he has to rush back home because his wife has had a baby, the refusee 
first shows his surprise and then changes the topic: Zhen buqiaol Gongxi ni dang baba 
le. (Really not coincidence! Congratulations you as father PRT.) 'What a coincidence! 
Congratulations on your being a father.' 

3. Counter-rationale, trying to persuade the refuser 

In one realistic conversation made up by an informant for the request of a low- 
status worker for salary raising which was refused by the high-status boss, the boss 
used the strategy of postponement, "I am very busy now. Let's talk about it later." As 
the post-refusal reaction, the worker says, "Wo xianzai jiou yao zhidao (I now want 
know) 'I want to know the answer now," which is a counter-rationale. Then the boss 
insisted on talking about it later by saying, "Wo shwo deng yi-xia. Wo xianzai yao qu 
song shu le. Hao nao kan dian (I say wait a moment. I now want to send books. Good 
take-care-of store) 'I said, "Let's talk about it later," 1 must send the books out. You, 
take good care of the store.'." 

When refused by the status-equal to lend classnotes because the refuser also has to 
prepare for the test, tiie refusee often uses the strategy to persuade the refuser, "Xianzai 
xian jie wo yixia nta! Yingyin h\\>ei-lai jiou Iman ni (Now first lend me one-minute 
PRT! Copy come back then return you.) 'Lend me for just one minute. I will return 
you after copying it.'." The copying machine is so popular that the refusee often utters 
so as counter-rationale for similar cases. 

4. Complying with the refuser 

After being refused by the high-status boss against salary-raising by a strategy of 
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offering an alternative of a good year-end bonus, the worker says, "Xiwan niandi neng 
gankwai dao (Hope year-end can quickly come) "I hope it will soon be the end of the 
year.'," which is the compliance with the refuser. 

When the high-status boss wanted to invite the top-managers and their 
wives, most informants have managers tell the boss the specific reasons for 
being unable to accept the acceptance. Owing to the adequacy of reasons, most 
informants make the boss comply with the refuser (refer to Table 2). If the reasons 
given are vague, most informants make the boss ask for explanation (Strategy 19). 

5. External yes, internal no (Unwilling compliance) 

One informant, in giving the conversation for the request of salary raising, made 
the worker say, " Hao bal Na ye zhiyou zheyang (OK PRT. Then too only thus.) 'OK, 
it's the only way.'," in response to the high-status boss's postponement strategy, 
"....When the profit of the couipany has returned to its original rate, I will make 
adjustment on your salary." Then the informant put in the parentheses that the worker 
does not think the boss will be willing to raise his salary, which shows that the 
compliance with the refuser is only on the face value of the words. Strategy 4 and 
Strategy 5 are different in the speaker's mind only; they are the same on the surface 
value of the speech. 

6. Saying 'thank you' 

In response to the explanation strategy of the boss for the impossibility of salary 
raising for the time being and his promise of future conformity, one informant made the 
worker say, "Xiexie ni, laoban. Wo zhidao ni juedwei buhwei kweidai wo (Thank vou, 
boss. I know you absolutely not maltreat me.) 'Thank you, boss. I know you will not 
maltreat me.'," the THANK YOU of which means 'complying with the refuser'^ 

After being refused by the status-equal classmate to lend his classnotes by giving 
the requester a lesson, "I know you are always busy. I have helped you many times. 
However, this time, I think that you cannot enjoy the crop which you have not planted, 
Ni buke laoshi zwo-xiang-qi-cheng (You cannot always sit enjoy its accomplishment)," 
the refusee used the composite strategy of first showing regret, then saying 'I'll be 
grateful to you' and then repeating the request: "Wo xiaci buhwei le. Wo yihou hwei 
hao hao xie ni de. Keshi mingtian jiouyao kaoshi le ye! (I next time not-can PRT. I 
later will good good thank you PRT. But tomorrow will-be test PRT PRT.) 'I won't 
(do the same thing) next time. I will be grateful to you. But it will be test tomorrow. 
Please.'" 

When the host refuses the cleaning lady's offer of compensating for the broken 
vase, by the strategy of a philosophical statement of swei swei ping-an (smash smash 
safe) 'May you have a sare and happy year' and then say specifically that it is not 
necessary to reimburse, the cleaning lady may say, "Xie-xie ni, lao-ban (Thank you, 
boss) 'Thank you, sir.'," and followed by a promise of working harder and more 
carefully in the future. 

7. Criticism 

This rarely happens; only three out of the total of 576 request-refusal-postrefusal 
sequences used the strategy in Sub-study I, the post-refusal utterances to the twelve 
situations. However, as a participant observer, I obtained the post-refusal speech act of 
an ill-mannered saleslady who used the strategy of criticism after she failed in several 
attempts to sell clothing to the same lady, when the customer is about to leave, by 
saying, "(Nide) shencai name cha, hai name tiao ((Your) figure so bad, still so 
selective) 'Even you, whose figure is so bad, are so stringent in selection.'," which is 
very a rude manners toward and will intimidate customers. 

8. Threat 

In Takahashi et al.'s article, after being refused by the high-status boss against the 
raise of salary, the low-status worker is made to use the strategy of threat, Then I 
guess I'll have to look for another job. For the same situation, one informant made the 
worker end tiie conversation by saying, "Hao bal Jiran zheyang, jiou swan le. Yihou 
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nijiou zhidao you sheme liougwo (OK PRT! Since so, then over PRT. Later you then 
know have what consequence) 'Since it is so, then 1 won't talk about it any longer. 
You will know what the consequence is later.'," which is a threat without being uttered 
specifically. The former part of this post-refusal speech act is a topic-closing 
expression, and the latter a threat. Therefore, stricily speaking, it is a composite 
strategy (Strategy 21). 

9. Reprimand 

When being refused by a low-status worker to work overtime for 1 or 2 hours, the 
boss used the strategy by commenting, "Ni zongshi dwei jiaban you ruci dwo de buyan 
yu qiaohe. (You always to overtime have so many POSS reluctance and coincidence) 
'You always have other things to do and are always reluctant to work overtime.'" 

For the same situation, when the low-status worker explained that he has other 
things to do so that he cannot work overtime, another of our informants made the boss 
use the strategy of reprimand by saving, "Sheme shi? Twei diao ta. (What thing? 
Cancel it) 'What thing is so important? Cancel it." In the boss's idea workers must put 
the company jobs as the first priority; he cannot tolerate this kind of excuse for not 
being able to work overtime. 

10. Offering self an alternative 

After being refused by a high-status boss with the strategy of explanation of the 
present low profit of the company, one informant has the worker give himself an 
alternative, "Jiran ruci, wo zhihao lingwai xiang banfa le (Since so, I only another 
think method PRT) 'Since things are like this, I can only find some other methods (to 
earn more money).'," the alternative of which is very vague. 
11. Modifying one's request 

When being refused by a low-status worker to work overtime for 1 or 2 hours 
(when a boss says 1 or 2 hours, most people in Taiwan think it should be almost 2 
hours or more than 2 hours), one boss modifies his request by saying, "Buran jia \i 
xiaoshijiou hao. (If not, add one-hour then OK) 'How about working overtime for only 
1 hour?'" 

After a refusal to attend a party at a status-equal's house by the excuse that the 
refuser will meet a friend that evening, the host may modify his invitation, "Yao nide 
peng-you yicfi lai ina\ (Invite your friend together come PRT) 'How about inviting your 
friends to come, too?'" 

12. Asking the refuser to find an alternative 

When the boss gives the low-status worker the suggestion of writing reminders on 
a small piece of paper and is refused by the latter by the strategy of explanation that the 
worker tried it without success, the boss uses the strategy by saying, ^'Name xiwang ni 
nenggou xunzhao yige sliihe ziji de fangfa (Then hope you can find an appropriate self 
POSS method) 'Then I hope you can find a method suitable for yourself," in which 
the refusee asks the refuser to find an alternative for the refuser himself, not for the 
refused, because suggestion is supposed to be beneficial to the addressee. 

Another example, an FTA request-beneficial to the requester, is shown as 
follows, which is a realistic conversation I got from participant observation. Similar 
kinds of conversation can often be heard, but are not made up by the native language 
informants for the 12 DCT items mentioned as Sub-study I of this study. 

Teacher: Yitaii, ni ba zhexie ziliao tongji chulai. 

Yifan, you BA these data calculate out-come. 
'Yifan, I want you to use your statistics knowledge to make the data 
(the teacher's data) meaningful.' 

Student: Laoshi, wo zheng mang zhe xie lunwen, mingnian zai zwo, hao ma? 

Teacher, I now busy CONT-PRT write dissertation, next year again do, 
OKQ-PRT? ^ 
'Ma'am, I am busy with my dissertation. Can I do it for you next 
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year?' 

Teacher: Na ni tweijian yige lai bang wo zwo. 

Then you recommend one come help me do. 

'Then, you recommend one person to help me with the statistics.' 

In this conversation, the refusee (the teacher) asks the refuser to find an alternative for 
her. 

13. Explanation of the justification of the request/suggestion 

After being refused by a high-status boss with the strategy of explanation of the 
poor profit for the whole company plus the strategy of postponement, a worker may 
respond by saying, "Wo yi-jia dwo kao zhe-fen xin-shwei gwo hwo, ni jiou gei wo jia 
xin ba (My whole-family all depend-on this salary to live, you therefore give me more 
salary PRT) 'All my family depend on the salary; please give me raise of salary.'," 
which is an explanation of the justification of the request without refuting the boss' 
reasoning. 

After being refused by a status-equal classmate with the strategy of explanation 
that his classnotes have just been borrowed if the requester came earlier, he would be 
able to borrow it, the refusee used this strategy by saying, "Name nawei tongxue heshi 
hwei han ni? Wo jixu yong ta lai jiou ming. (Then that classmate when will return you? 
I urgently-need use it come save life) 'When will the borrower return you the 
classnotes? I desperately need it to save me.'" The latter part of the refusee speech is 
the explanation of the justification of the request. 

14. Postponement 

After being refused by the boss, the boss with the same composite strategy of 
explanation (of the poor profit for the company) plus postponement (promise of salary- 
raising in the future) as in a previous paragraph, a worker may also explain the 
necessity of the immediate raise of salary plus asking the boss to re-consider about it 
and to postpone giving' a definite reply, "Keshi lao-ban, wo zheiide xuyao dwo yi-dian 
de xin-sliwei, jiali feicfiang xuyao, ke fo qing lao-ban zai kaolu yifan? (But Boss, I 
really need more a-little POSS salary, home very need, whether or not please Boss 
again consider?) 'But Sir, I really need a little more salary because of the family 
condition. May I ask you to re-consider it?'." 

Besides asking for the postponement in answer as shown in the last paragraph, 
here is another example: After being refused by a status equal to lend his classnotes by 
a strategy of a lie, saying, "Keshi wode biji hai mei zhengii hao (But my classnotes yet 
not organized well) 'But I have not made the classnotes well organized.'," the refusee 
used the strategy of postponement, asking the refuser to lend it to him when it is well 
written." 

15. Giving a lesson 

After being refused to take his advice of keeping reminders on a small piece of 
paper by the low-status worker with the strategy of explanation, "Xie zhitiao de fangshi 
shi hen buz^o. Danshi wo lian xiao zhitiao dwo hwei Iwan diou, soyi you xie gen mei- 
xie yiyang. (Writing paper-slip POSS method is very not-bad. But I even small paper- 
slip all can disorder drop, therefore have write with not-write the-same.) 'It is a good 
idea. But I even cannot find the reminding slip. Therefore, writing or not writing are 
the same.'," the boss gives the low-status refuser a lesson, "Na ni shizai shi tai lan-san 
le. Yiao hao hao gaijiii nide gexing. (Then you really is too lazy PRT. Should good 
good improve your personality.) 'You are really very lazy. I hope that you can change 
the bad habit.'" 

When the househost refuses the offer of the cleaning lady to compensate for the 
broken vase, the cleaning lady may show her gratitude to the broad-mindness of the 
host and then give herself a lesson by saying, "Wo yi-hou hwei xiao-xin gong-zwo. (I 
later will carefully work) 'I will work more carefully.'" 

16. Repealing the request 
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After the high-status boss refuses to raise the salary by the strategy of 
postponement, the worker may repeat the request by saying, "Daslii wo zhende xuyao 
jiaxin (But I really need add-salary) 'But I really need the raise of the salary.'," which 
is the repetition of the request without giving more reasons. Repetition of a request 
stands for emphasis of the original message, not a null. 

17. Comforting the refuser ar.o/or self 

When the refuser refused to lend a motorcycle for the refusee to go out on a 
holiday, the refusee may use the strategy to comfort the refuser, "Mei gwatuil Wo ling 
xiang ban/a (Not matter! I another think method.) 'It does not matter! I will think of an 
alternative.'," which is to ease the uneasiness of the refuser because doing the FTA of 
rejecting may not only cause uncomfortable feelings in the refusee, but also cause the 
same feelings in the refuser, too. 

18. Showing surprise 

After being refused by a status-equal classmate by the explanation that she fell 
asleep in the class so that only the beginning of the professor's lecture is on the 
classnotes and she herself was nervous about the test at the moment, "I am sorry I don't 
have classnotes to lend you. I myself is thinking of borrowing them," the refusee 
shows great surprise, "Zenie keneng? Biji kongzhu juran yehwei zheyang] (How 
possible? Classnote princess even also-can so) 'How is it possible. You are called 
Princess Classnotes. Even you did not keep them!'" 

If a person (S) often does favor to another (A), S will feel surprised, if not show 
it, if A refuses him/her when being request to do S a favor of service. Then S will feel 
that s/he has befriended a wrong person. 

19. Asking for explanation 

After being refused by the high-status boss (to raise the salary) by the strategy of 
explanation that the worker has not reached the standard of salary raise, the worker 
used the strategy of 'asking for explanation' by saying, "Na wo yao zeme zwo, cai keyi 
fuhe nijicLxin de biaozhwun ne (Then, I must how do, then may conform you add-salary 
POSS standard Q-PRT) 'Then what is your standard of salary raising?'." 

For the same situation of asking for salary raise, an informant made the worker 
use the strategy of asking for explanation to be more specific, "Na shi sheme shihou 
ne? Xia-ge-yue shi ma? (Then is what time Q-PRT? Next-one-month, is Q-PRT?) 
'When will it be, next month?'," after the boss uses the stategy of postponement by 
saying, "Yes, I know that you have worked very hard. There will be some reward for 
the hard-working people." 

In any situation of request, invitation, offer, suggestion, if the refuser does not 
offer convincing reaspns, foe example, a vague excuse or a direct NO without any 
reason, it is possible tor the refusee to ask for explanation. 

20. Showing disappointment and/or regret, or complaining 

After being refused by the boss by using the strategy of explaining that there is the 
global depression and his salary is high already, the worker complains, "Meici ni dwo 
zheyang jiang (Every-time you all so say) 'Your excuse is always the same.'," to show 



After being refused by the status equal classmate by giving him a lesson, "You 
always act in this way. It seems a little unfair that you borrow other people's 
classnotes, which is the result of hard work, in the name of a test. While other people 
are working hard, you cut class and have a good time," a classmate may use a 
composite strategy of showing regret and then repeat the request: "Hao la! Wo zhidao 
wo zwo le. Ni jiou zaijie wo yici mat Bai [wo la (OK PRT! I know I wrong PRT. You 
then again lend me one-time PRT! Please PRT.) 'I am sorry. I know I am wrong. But 
please lend it to me once again.'." 
21. A composite strategy 

It goes witliout saying that many refusees used the composite strategy as the post- 
re I'u sal speech act. 
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After the boss was refused by the low-status manager to attend the party at the 
boss's house by the strategy of explanation that that day happens to be his daughter s 
fwentiethTirtSay the whSL f^unify haye planned for t^e brrthday, so that he is sorry 
to be unable to attend the boss's party, one informant made the ^oss utter Z^en^^. . 
Natian wo hwei zhunbei yige sheng-ri liwu gei m nur. zhwu ta sheng-ri kwaile (Rea Hy / 
That day I will prepare a birthday present give your daughter wish her birthday 
happy) 'Really!? I will prepare a birthday present for her and wish her a happy 
birthday ' " in which the boss first shows his surprise and then changes the topic, and 
of course complies with the refuser. This is quite typical of a composite strategy. 

22. Topic-closing expressions . u„ j 

After the high-status boss refuses to raise the salary by explaining the bad 
economic situation of the company, the worker may say, "Jiran nici, name yihou zai 
tan hao le (Since so, then later again talk OK PRT) 'Since the situation is so, let s talk 
about it later.'," which is a postponement strategy and also a topic-closing expression 

After being refused by the high-status customer to eat at a restaurant and sign the 
contract by the explanation that the company has decided to buy the goods from another 
supplying company, the salesman used the strategy of topic-closing expressions: Hao, 
nabuda-rao le. (OK, then not bother PRT) 'OK, tlien I won't bother you now. 

In'respolfse to a status-equal refuser against the offer of one more cake one 
informant had the refusee use the strategy of joke. The exact content of the 
conversation made up is as follows. 

Friend A: Zai lai yi-kwai dan-gao ba! 

Again come one-piece cake PRT! 

'How about one more cake?' 
Friend B: Ni xiang rang wo cheiig si ma? Bu yong le. Xie-xie. 

You want make me full dead Q-PRT? Not want PRT. Thank you. 

•Do you want to make me full to death? No, 1 don't want one more. 

Thank you.' 
Friend A: She-me ma! Wo shi xiang fei si ni. 

What PRT? I am think fat death you. ... 

'What excuse it is! My intention is to make you fat to deatn. 

3.2 Sub-study I , . , ■ a ■ . e 

Takahashi et al. (1986) give 12 DCT items, which are categorized into four 
categories-requests, invitations, offers, and suggestions-to informants of 80 students, 
who are Anierican native English speakers, Japanese native speakers. Japanese 
speaking English as a second language and Japanese speaking English as a foreign 
language Each category has three situations: a refusal to a higher status person one to 
a lower status person, and one to a status equal. The DCT items are ^iven in the form 
of request, then a blank for the informants to fill in-which must be refusal to the 
request, and then the given requester's response. The 12 DCTs will be reviewed by the 
category here. 

Request: , , r u • u 

To high-status refuser: Then I guess I'll have to look for another job. 
To low-status refuser: That's too bad. I was hoping you could stay. 
To status-equal refuser: OK, then I guess I'll have to ask somebody else. 

In response to a high-status refuser against request of salary raise, the worker says, 
"Then I guess I'll have to look for another job," which is a threat strategy^ In response 
to a low-status refuser against request of overtime jobs, the boss says. That s too bad 
I was hoping you could stay," which is the strategy of expressing disappointment and 
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regret. In response to a status-equal refuser against request of borrowing lecture notes 
the classniate says, "OK, then I guess I'll have to ask somebody else," which is the 
strategy of offering self an alternative. 

Invitation: 

To high-status refuser: Perhaps another time. 

To low-status refuser: That's too bad. I was hoping everyone would be there 
To status equal: OK, maybe another time. 

In response to a high-status refuser against invitation to an expensive restaurant and 
Sign the contract, the salesman says, "Perhaps another time," which is a strategy of 
postponement. In response to a low-status refuser against invitation, the boss says 

lhat s too bad. I was hoping everyone would be there," which was the strategy of 
expressing regret In response to a status-equal refuser against refusal, the friend says 

UK, maybe another time." which is also a strategy of postponement. 

Suggestion: 

To high-status refuser: OK. it was only a suggestion. 
To low-status refuser: Well, it's ai, idea anyway. 
To status-equal refuser: You should try it anyway. 

In response to a high-status refuser against suggestion, the student says, "OK it was 
only a suggestion,' which is a topic ending utterance. In response to a low-status 
refuser against suggestion, the boss says, "Well, it's an idea anyway," which is also a 
topic-closing utterance to end the conversation. In response to a status-equal refuser 
against suggestion, the friend says, "You should try it anyway." which is the repetition 
ot the request for suggestion. In our modified DCT, or better call it realistic 
conversation wrumg, only one informant has the boss say the Chinese equilavent of the 
hnglish. Well, it was only a suggestion,' which shows that Chinese people in Taiwan 
do not use the topic-ending utterance so soon. 

Offer: 

To high-status refuser: Oh, I'd feel better if I paid for i» 

To low-status refuse -: Well, maybe you should give it some more thought before 

turning it down. 
To status-equal refuser: Come on, just a little piece. 

In response to a high-status refuser against offer, the cleaning lady says. "No I'd feel 
better it 1 paid for it," which is the repetition of the request for offer and also a strateev 
of counter-rationale, trying to persuade the refuser. In response to a low-status refuser 
against offer, the boss says, "Well, maybe you should give it some more thought before 
turning It down, which is a strategy of postponement, asking for postponement in 
[ncV. 1 H '■".!'""^'f status-equal refuser against an offer, the friend says, "Come on 
just a little piece, which is the repetion of the offer and modification of the request. 

For the above twelve post-refusal speech acts, we should say, they are Takahashi 
tnir^"'^f' '''''''' ^'^^y --eflect a few of many possibilities Vf posfrefusal 
1 Threat strategies the two researchers used are nine for the twelve situations: 

2. Offering self an alternative 

3. Expressing disappointment and/or regret 

4. Postponement 

5. Topic-closing expression 

6. Repetition of the request 
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7. Counter-rationale 

8. Asking for postponement rn response 

9. Modification or the request 

Compared with the number of strategies which we summarized in the study, their 
strategies are few. It is easy to understand because theirs stand for the wisdom ol 
(maybe) only two researchers. In addition, though they may think of many strategies 
for post-refusal speech act, il is their task which forces them to simplify the post-refusal 
speech and only ONE hint can be given for the subjects to fill in refusal expressions. 
Our study focus on the subject's strategy of post-refusal speech act, which are based on 
the data from 48 informants, who were given the tasks in November, 1993. 



Table 1 : Stiategies used by the refused 
in response lo the lel'usei of ditYerent social statuses against request 
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9 


2.Changin;:; *he topic 


0 


0 


2 


J . V.'UUilld iallL/lUlis^ 


16 


24 


20 


4. Complying with the 

1 ^1 Li O^l 


7 


1 


10 


S PxttTiifil ves internal no 


1 
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2 


0 
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13. Explanation af the 
justification of the request 


5 


3 


1 


14. Postponement 


1 


4 


0 


15. Giving a lesson 


0 


0 


0 


16. Repeating the request 


~f 


1 


0 


17 Comforting the refuser 
and/or self 


0 


0 


0 


18. Showing surprise 


0 


1 


1 


19. Asking for explanation 


4 


1 


1 


20. Showing disappointment 
and/or regret, or complaining 


1 


1 


0 


21 . A Composite strategy 


2 


5 


1 


22. Topic-closing expression 


1 


2 


0 


23. Joke 


0 


0 


0 1 



The following are the summaries we get from valid realistic conversations made 
up by 48 adult native speakers of Mandarin in Taiwan. ach of them were asked lo 
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make up 12 conversations based on their experience, observations, and intuition. From 
Tables 1 through 4, if a silence strategy is used as a post-refusal speech act, it means 
that the negotiation of the episode of request-refusal only consists of a 'dialogue 
turn'—the speaker speaks (to request, suggest, invite, and offer), and then the refuser 
refuses, then the conversation is over, which happens rarely-the data also support the 
observation. We find that most realistic conversations consists of two 'dialogue turns' 
or more than two. If it is more than two 'dialogue turns', only the requester's first 
post-refusal speech act, that is only the first speaker's speech of the second turn is 
discussed and calculated in Table 1 through Table 4. 

For the first kind of request-the requester is of a low status, the post-refusal 
speech act strategies of the requester are of 14 among the 23 (refer to Table 1). For the 
request made to a status-equal classmate, the post-refusal speech act strategies of the 
requester are 12 of the 23 (Table 1). As for the request made to a low-status worker, 
the post-refusal speech act strategies of the boss include 9 of the 23 strategies. 

In response to a high-status refuser for an invitation to an expensive restaurant and 
sign the purchase contract, the low-status refusee used 9 strategies out of the 23 (refer 
to Table 2). In response to a status-equal refuser for an inviation to his house party, 10 
strategies out of the 23 are used (also refer to Table 2). In response to a low-status 
refuser for an invitation to the boss's house party, 7 strategies out of 23 are used (also 
refer to Table 2). 

In response to a high-status refuser (the teacher) for a suggestion of more 
conversations and less grammar in a foreign language course, most teachers use the 
siraJegy of giving a lesson to emphasize the importance of grammar to refuse to accept 
the student's suggestion. The students' post-refusal speech act strategies are 9 
strategies out of the 23 (see Table 3). In response to an equal-status refuser for trying 
the new diet suggested, the refusee used 12 strategies out of the 23 mentioned in this 
study. In response to a low-status refuser against a suggestion of writing reminders on 
a slip, the bosses used 12 strategies out of the 23 (see Table 3). 

In response to a high-status refuser against the cleaning lady's offer of 
compensating him for the broken vase, the cleaning lady reacts in 10 strategies of the 
23. In response to a status-equal refuser against the offer of one more cake, the refusee 
reacts in 8 strategies out of the 23. In response to the low-status refuser, the boss reacts 
in 10 strategies of the 23 mentioned in the study (refer to Table 4). 



Table 2: Strategies used by the refused 
in response to the refuser of different social statuses against invitation 



Strategies 


To High-status 


To Status-equal 


To Low-status 




refuser 


refliser 


refliser 


1. Silence 


0 


2 


0 


2. Changing the topic 


0 


1 


0 


3. Counter-rationale 


9 


14 


15 


4. Complying with the refuser 


7 


8 


18 


5. External yes, internal no 


0 


0 


0 


6. Saying THANK YOU 


0 


0 


0 


7. Criticism 


0 


c 


0 


8. Threat 


0 


0 


2 


9. Reprimand 


0 


0 


0 


10. Offering self an alternative 


0 


0 


0 


1 1 . Modifying one's request 


2 


1 


1 


12. Asking the refuser to find 


0 


0 


0 


an alternative 
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Table 2: Strategies used by the refused 
in response to the refuser of difterent social statuses against invitation (Continued) 



13. Explanation of the 
justification of the request 


2 


0 


0 


14. Postponement 


14 


5 


4 


15. Givini» a lesson 


0 


0 


0 


16. Repeating the request 


0 


0 


0 


17. Comforting the refuser 
and/or self 


0 


0 


0 


18. Showing surprise 


3 


1 


0 


19. Asking for explanation 


5 


9 


4 


20, Showing disappointment 
and/br regret, or complaining 


0 


3 


0 


21. A Composite strategy 


4 


2 


4 


22. Topic-closing expression 


1 


0 


0 


23. Joke 


0 


0 


0 


Table 3: Strategies used by the refused 
in response to the refuser of different social statuses against sug 


gestion 


Strategies 


To High-status 
refuser 


To Status-equal 
refuser 


To Low-status 
refuser 


1. Silence 


9 


5 


6 


2. Changing the topic 


0 


0 


0 


3. Counter-rationale 


23 


15 


10 


4. Complying with the refuser 


4 


4 


1 


5. External yes, internal no 


0 


0 


0 


6. Saying THANK YOU 


3 


0 


0 


7. Criticism 


0 


3 


0 


8. Threat 


0 


0 


0 


9. Reprimand 


0 


0 


2 


10. Oftering self an alternative 


0 


0 


0 


1 1 . Modifying one's request 


0 


0 


0 


12. Asking the refuser to find 
an alternative 


0 


0 


0 


13, Explanation of the 
justification of the request 


0 


4 


2 


14. Postponement 


0 


0 


0 


15. Giving a lesson 


0 


2 


2 


16. Repeating the request 


3 


5 


3 


17. Comforting the refuser 
and/or self 


0 


0 


1 


18, Showing surprise 


0 


1 


0 


19. Asking for explanation 


1 


2 


8 


20. Showing disappointment 
and/or regret, or complaining 


0 


2 


0 


2 1 . A Composite strategy 


4 


^ 
J 


7 


22, Topic-closing expression 


1 


2 


5 


23. Joke 


0 


0 


0 
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Table 4: Strategies used by the refused 



in response to the re 


user of different social statuses against offer 


Strategies 


To High-status 
refuser 


To Status-equal 
refuser 


To Low-status 
refuser 


1 . Silence 




5 


4 


2. Changing the topic 


0 


0 


0 


3. Counter-rationale 


10 


20 


18 


4. Complying with the refuser 


1 


8 


3 


5. External yes, internal no 


0 


0 


0 


6. Saying THANK YOU 


7 


0 


0 


7. Criticism 


0 


0 


0 


8. Threat 


0 


0 


1 


9. Reprimand 


0 


0 


0 


10. OtTering self an alternative 


0 


0 


1 


1 1 . Modifying one's request 


1 


2 


0 


12. Asking the refuser to find 
an alternative 


0 


0 


0 


13. Explanation of the 
justification of the request 


3 


0 


0 


14. Postponement 


0 


0 


7 


15. Giving a lesson 


1 


0 


0 


16. Repeating the request 


10 


8 


1 


17. Comforting the refuser 
and/or self 


0 


0 


0 


18. Showing surprise 


0 


2 


0 


19. Asking for explanation 


0 


0 


3 


20. Showing disappointment 
and/oi 1 egret, or complainmg 


5 


0 


0 


2 1 . A Composite strategy 


8 


2 


.6 


22. Topic-closing expression 


0 


0 


1 


23 Joke 


0 


1 


0 



4. Maxims of post-refusal speech act 

The twenty-three post-refusal speech strategies can be summarized into 4 
maxims-maxims of consistency, offense, agreement, and economy. The maxims of 
economy is not in complementary distribution with the other three; instead, it may 
overlap with other maxims. The maxim of address, which I contend in Liao (1994) to 
be a maxim of politeness in refusal, is also appropriate here. It is the Chinese custom 
to use the address form, to replace the second person pronouns, repeatedly when 
speaking to a high-status person. 

The strategies of 'complying with the refuser', and 'saying thank you' can be 
categorized into the maxim oF agreement. 

The strategy of 'criticism', 'threat', 'reprimand', 'giving a lesson', 'showing 
disappointment', and 'showing surprise' can be categorized into the maxim of offense. 
When a person used the offense maxim in post-refusal speech, it seems that s/he has the 
equal-treatment idea: since the refuser does not care about doing FTAs, then I will treat 
you as I am treated. When their face is threatened by being refused, they defend their 
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face by attacking the other's face, too. 

The strategy of 'counter-rationale', 'modifying one's request', 'asking the refuser 
to find an alternative', 'explanation of the justification of the request', 'postponement', 
'repeating the request', 'asking for explanation", 'showing regret', and a composite 
strategy can be classified into the maxim of consistency. 

The maxim of economy may include the strategies of 'silence', 'changing the 
topic', 'external yes, internal no', 'saying thank you', 'offering self an alternative', 
'topic-closing statement', and 'comforting the refuser and/or self. 

'Silence' is the mc t economic way in the maxim of economy. However, it is not 
widely used because 'a refusal without some kind of elaboration or follow-up can have 
severe social consequences (Gass and Houck, 1993: 1).' Chinese people use the 
strategy of 'silence' mostly together with being or pretending to be absent-minded or 
beginning to be busy with other tasks. 

5. Sub-study II 

As mentioned in Part 3, The Study, 596 subjects-298 males and the same number 
of females-were asked in December 1993 to judge their own features in 11 statements 
(labeled XI 1 to X21). They were asked to circle in a seven-point scale from 'strongly 
disagree' (labeled '1') to 'strongly agree' (labeled '7') for the eleven statements. After 
doing factor analysis on SAS, we find that three factors can be concluded to include the 
11 statements. Factor I is the factor mainly comprising X14, X16, X15, and X17; 
therefore it is labeled Group-relationship factor. Factor II is the factor mainly 
comprising X20, X19, X18, and X21, labeled Outstanding factor. And Factor III is the 
factor mainly comprising X12, X13, and Xll, labeled Loyalty factor. 

Factor I (Group-relationship factor): 

X14: I am open about my disagreement with my group. (Mean: 5.42) 
XI 6; My relationship with others is more important than my own accomplishments. 
(Mean: 5.31) 

X15: I respect people who are modest about themselves. (Mean: 5.39) 

X17: My happiness depends on the happiness of those around me. (Mean: 5.03) 

Factor II (Outstanding factor): 

X20: Speaking up in a group is not a problem for me. (Mean: 4.2) 

X19: I enjoy being unique and different from others. (Mean: 4.38) 

X18: I am comfortable with being singled out for praise or rewards. (Mean: 4.96) 

X21 : I find it hard to do anything that my parents would disapprove of (Mean: 4.86) 

Factor III (Loyalty factor): 

X12: I won't support my group if they are wrong. (Mean: 5.07) 

X13: I will stay with a group that needs me even if I am not happy with it. (Mean: 
4.80) 

Xll : I would be likely to sacrifice my self interest for the good of the group. (Mean: 
4.98) 

After concluding the eleven statements into three factors, the three factor scores 
for each subject were then obtained, which are standardized (that is, the mean is 0 and 
the standard deviation is 1). Then I did Student t-test to compare the difference 
between two sexes. Table 5 shows that males and females are not different in the 
Group-relationship factor and the Loyalty factor, while in the Outstanding factor, they 
are significantly different from each other. In other words, the low mean for females in 
the Outstanding factor indicates females are significantly reluctant to be unique, 
outspoken, and disapproving. It reflects the invisibility of women in language and 
behavior. Because ot the female and male difference in their characteristics, males and 
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females naturally have different strategies in post-refusal speech act. 



Table 5 Males and females' standardized means 
(and standard deviation) in three factors 





Group-relationship factor 


Outstanding factor 


Loyalty factor 


Males 


-0,0628 (1.000) 


0.1096 (0.992) 


0.0135 (1.073) 


Females 


0.0628 (0.998) 


-0.1096 (0.997) 


-0.0135 (0.923 


t-test value 


1.54 


2.69 


0.34 


(p-value) 


(0.125) 


(0.007) 


(0.74) 



According to Liu (1991: 229-243), prior to the 1960s women tended to conform 
to the group more tiian men did. However, the finding in this section seem to say that 
the women liberation movement since 1960s has not changed the phenomenon much. 
We agree to Liu's contention: when the group is important to a person, when a person 
has no sense of security, and/or when a person has no self-confidence, s/he tends to 
conform to the group s/iie is in so that s/he can be accepted. On the other hand, when 
a person thinks that his position in the group is unquestionable, and/or when he is 
generally accepted in the group, the chance for him to express different opinions will be 
great. 

The fact that men and women use different speech strategies may originate from 
child-rearing practices. Boys and girls are treated and expected differently since born. 
Chaiaka (1989: 3) also indicates that by studying sociolinguistics one can find how 
social situations determine what kinds of speech are used and how speech develops to 
meet social needs. 

These 596 males and females are also asked to write what they would say after 
they are refused for the following six situations. Each of the 298 men and the 298 
women was asked to fill in the post-refusal speech for only one FTA request. 

1. You don't have a car so you ask a friend if you can borrow his car so that you can 
take your visiting parents out. The friend refuses saying he doesn't lend his car to 
anyone. What, if anything, would you say after your request had been refused? 
Write down exactly what you would say. Use the exact words you would speak. 

2. You need NTS 15,000. You ask your older sister if she would be willing to lend you 
the money which you promise to repay in 3 months. She answers that she only has 
NT$30.000 in her bank account which she needs for next semester's tuition. What, 
if anything, would you say after your request had been refused? Write down exactly 
what you would say. Use the exact words you would speak. 

3. You are the president of the Alumni Association in charge of a reception for college 
alumni. At the last minute, key people on your committee back out. You ask a 
friend who owes you several favors to help. S/he refuses. What, if anything, would 
you say after your request had been refused? Write down exactly what you would 
say. Use the exact words you would speak. 

4. You have just asked a very good friend for help in moving. It will take all day 
Saturday. Your friend refuses saying that s/he has an important exam coming up. 
What, if anything, would you say after your reuuest had been refused? Write down 
exactly •.vhat you would say. Use the exact words you would speak. 

5. You are buying your course Looks with a friend. You discover that you are short by 
NT$500 and you ask to borrow this amount. You promise to repay this money as 
soon as you are able to. Your friend says that s/he n^eds the money and expresses 
reluctance to lend it. What, if anything, would you say after your request had been 
refused? Write down exactly what you would say. Use the exact words you would 
speak. 

6. Unavoidable confiicts have caused you to miss several classes. You don't really 
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know anyone in your class so you ask a classmate if s/he would be willing to let you 
photocopy her classnotes after class. There is a copying machine nearby, b/he 
refuses to lend you the classnotes. What, if anything, would you say after your request 
had been refused? Write down exactly what you would say. Use the exact words 
you would speak. 

Following the open-question, they were asked to circle on a seven-point scale 
from 'strongly disagree' as 1, to 'strongly agree' as 7 for the following statements: 

1 . 1 would just drop the subject. 

2. I would give reaso-.s to try to persuade my classmate to comply with- my request. 

3. 1 think that my request imposes on the other person. 

4. 1 feel it is very important to persist in asking this request. 

5.1 Male and female difference in post-refusal speech strategies 

Factor analysis and the Student t-test have told us the difference between men and 
women It is therefore, hypothesized that women use the maxim of offense less often 
than men do; instead women use more often the maxim of agreement to show they 
don't like to stand out in speech. Men have been conditioned from early childhood to 
be adventurous in both behavior and language. Women have been conditioned to 
approve and support men and those in authority. • c cc 

Table 6 shows the numbers of men and women in applying the maxims oi ottense 
and not applying it, in their post-refusal speech act after they are refused in the above 6 
open FTA requests. Table 7 shows that 28 (9.4%) of the 298 males applies the maxim 
of offense in post-refusal speech and only 14 (4.7%) of the 298 females uses the 
maxim. The Chi-square value is 5.02 (p-value=0.013 for one-tailed test), which 
means that there are significantly more males who use the maxim of offense in post- 
refusal speeches. 



iauic u. i-<uiiiuvt yjL i.ii^jv^ >...xy ^ffj 


Males 


Females 


1 Refused by Rood friends (of car borrowing) 


7 


1 


2 Refused by older sister (large amount of money) 


1 


1 


3. Refused by a good friend (received a lot of help from 
you) 


7 


9 


4. Refused by a good friend (having good reasons tor 
preparing for a test) 


1 


I) 


5 Refused of small amout of borrowing 


4 


1 


6 Refused by an acquaintance 


8 


2 


Total 


28 


14 



Table 7- Applying the maxim oi oriei 


ISe or nui. maica va !<- 

Males 


Females 


Applying the maxim of self-detense 


28 (9.4%) 


14 (4.7%) . 


Not aoolvinu 


270 (90.6%) 


284 (95.3%) 


Chi-square value=5.02, p-value-0.013 (one-tailed test) 1 



In response to refusers for the above six FTA requests, males and females' 
circling of the seven-point scales are counted and Chi-square test for the homogeneirty 
of males and females is done. It is found that males and females are significantly 
different in their judgments about their own personality and characteristics for three of 
the above four statements. They are not different for the statement of 'I think my 
request imposes on the other person.' For easy interpretation of the data, we rename 
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the choice of 1, 2, or 3 as 'disagree', 4 as 'neutral', and 5, 6, or 7 as 'agree'. In other 
words, we changed the 7-point scale into a three-point scale for the sake of analysis. It 
is then found that significantly more males said that they would like to drop the subject 
(or change the topic) than females did (48.99% to 39.6%; df=2; with Chi-square value 
being 7.309 and p-value 0.026); significant more males said they would try their best to 
persuade the refuser than females did (36.58% to 24.83%; df=2; Chi-sauare value 
being 13.579 and p-value being 0.001); more males said that they shoula persist in 
asking their request than females (16.11% to 6.04%; df=2; Chi-square value being 
16.669 and p-value < 0.001). The result in this paragraph is also consistant with the 
finding of factor analysis: women are reluctant to be unique, to speak up, and to do 
something which others do not approve of; therefore, women are reluctant to abruptly 
drop the subject immediately after they are refused, reluctant to give reasons to try to 
persuade the interactant and to persist in asking the request. 

6. Conclusion 

When a person reacts to a refusal by the strategy of criticism, reprimand, threat, 
giving the refuser a lesson, he protects his own face, without saving the addressee's 
face. It is a maxim of offense, to return an FTA with an FTA. The maxim is used 
more by the high-status person than by the low-status, more by males than by females. 

Though twenty-three post-refusal strategies were listed in this study, in Sub-study 
I, only 18 out of the 23 are used in response to a refuser for an FTA request. The 
strategies of 'criticism', 'asking the refuser to find an alternative', 'giving a lesson', 
'comforting the refuser or self, and 'Joke' are not applied. Thirteen of the 23 
strategies are used in response to a refuser for an FSA request, or an invitation. The 
five strategies not used in response to request refused are not used; neither are the 
strategies of 'saying thank you', 'external yes, internal no', 'reprimand', 'offering self 
an alternative', and 'repeating the request'. Fifteen out of the twenty-three strategies 
are used in response to a suggestion refuser. The strategies not used are 'changing the 
topic', 'external yes, internal no', 'threat', 'offering self an alternative', 'modifying 
one's request', 'postponement', and 'joke'. Seventeen out of the 23 strategies are used 
in response to an offer refucer. The six unused strategies are 'changing the topic', 
'external yes, internal no', 'criticism', 'reprimand', 'asking the refuser to find an 
alternative', and 'comforting the refuser and/or self. 

After being refused, women are more likely not to persist on their request. It is 
wondered, whether women are less likely to refuse than males because they are afraid 
to be offensive. Further studies are invited. 

Notes 

'In this short article with a limit of maximal 20 pages, I cannot give enough examples 
for each strategy of post-refusal speech act, nor can I give the details of how I manage 
the procedure of factor analysis on the SAS (Statistic Analytical System) package. I 
have the intention of expanding it to give a clearer view of Mandarin post-refusal 
speech acts. If readers are interested in the whole article, it will be sent upon request 
on completion. 

^Though suggestion is supposed to be an FSA, it seems that the acceptance of it does 
not follow ihe rules in which one is supposed to accept other kinds of FSA invitation 
or offer. 

^It seems t!iat the post-refusal speech acts must be divided into two kinds, those of the 
refuser and tl.ose of the refusee, then they can be discussed in a clearer detail. 
However, in this study, we do not try this. We will try it in later studies. 

■^We agree to Levinson's observation (1983: 351) that a pre-request allows the 
requester-to-be to check out whether a request is likely to succeed, and if not to avoid 
one in order to avoid a subsequent dispreferred responses, a rejection. 

^The ambiguity of THANK YOU is discussed a great deal in Liao (1994). 
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'Ni-^ ' . ^tb$^t '--i- '^o '-P=r ' : 

81 tj- 

i«^T mi 

'* "t9#lt#^#-8}^'-:^6^-«t<g^PiL4'] : 'S.P.^f^^i^^^UM.n polar 
gradable terms ^ ^ ^ ^ -fi : 

[5^7 ] [§T ] 

[j^T ] [^1 ] 

+ : (>L^f^ 1993) 

# ^ Si? >^ tam-ir • [^^l^^T^yJk 1 ] 

Chhan^ a . [©^ilf^^ItT ] 

(5) 5|v^-t4: ^;%ll>ib, K*'jT^#->^6«J*ll , t ^-^ ^ ^ ^-^ ^ ^ 

(tL^^^iiL6«J##l«^ : -ka- ^ H 6<j -#-) 

10. '-^ ', 'T '6^)??^a3t . 

'f^-il^f'JJi^ t (1992: 111--121, 129-138) ' , ' T ' 6^ 5g * ^ T 
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^^^^^ ^^ik^ m^^^ir 

m^^B;. 




11. 



life? iA^SASfSi^fli^f ?^*^^^^^^feS^6?ll7f 5 

[6] 3tfi6^gl*§-6{;;^,fej^||-4^«?, S (1980) (1987). 
[8] i^a? (1958 :210-222). 

[9] |:^«^^^&|tf^H.^^_^5jS, (1992: 208, 366), ^t^^t^^^i^^^ (1992: 
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• 1^^* (1993: 1089-1090), $t (1988), © 
(1958: 212), (1983: 192-193). 

[11] Li (1988) f^jhit — JK- 

[12] 21 ^ 13 <K4^r^n^iTt^it^-^ • 

[13] y^^nmii.^il^M^i^ , Chao (197O: 458-480). itll^P 

i^^U^i^^ ^(1950)^0 Ko (1976). 

[14] ti-fn-^^ Wang (1969) jil "t" 5^ A^^^JIfti^BA^^i^^ 

(|^:t Wang and Lien 1993) ^it^t^^^i^-^i^'^^i-^^ 
^y^^^f^^i^i^^^f^ • (^* Mei 1980, Tottie 1991, Yue 
1993 , Lien 1994) . 

[15] 1^ (1992) i9^i^[^t^]^n^;A^^^^ilVf'Jd^S 

^Pil^^cyi-. tm^'i^i^'-ir'^'^^^m CHen 1992, 

1993 . 

'iJ-i^', '^illi^' [^iLi^ll] 

^ ^ >dB: • 

1993. &.1rfel^6<J '-^ '^'^ . Sl-tfef-^ .1.5.138-142. 
£.4>^. 19637 13] ^o;i|(&3ii^l i^^6<J#^^4/f . tSI^^^ 2.136-175. 

1993a. (±.«6). ^ (:lr 1 1^ ^ 3) 

>S« 4* # ji. it -ih 4" 

1993b. (i.«6). (i:t#^r^*i:*S5) 

-jI* 4* SS^ -2. it 'ih 4* '■i^ 

: ^994. (±«6). (Ir t S 12) 

^ t II 'fh t 

.f^^. 1990. '^BL^S 't6<jS«^ '. t aif ^ . 214.71-76. 

^/fc^J^t. 1950. ^^i^ykFfii. ^ : w».,1r^- ^ ^ . 

f'J^. t^^K. 1992. it,?,: 

f'jj)?,;ri. 1992. <4t#ldt6^^<fe]#4#-. ^ ' Jt*^.-; ' # 

*3tS. (.'f^) • 264-270. ilt : 
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9jn4-. 1989. . ibt : 
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^^m. 1980. 
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1984. (^t^T-^). ih:^ -tfr . 
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Irih: ^Xmttft<^^ 198iai3^p. 
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Identifying the Parameters for a Typology of Chinese Affixation* 



Yen-Hwei Lin 
Michigan State University 



Although affixational morphology is limited in the Chinese languages, ^ this study 
demonstrates that the morpho-phonological patterns of the diminutiv^ypocoristic and zi 
affixation are rich enough to merit a detailed investigation. After discussing and comparing 
different ways to cross-classify the various affixation patterns in more than twenty Chinese 
languages, I propose a set of parameters which are eitiier phonologically or morphologically 
defmed for an innovative typology of Chinese affixation. 

This paper is organized as follows. In §1, 1 briefly discuss the inadequacy of a more 
traditional classification for Chinese affixation. In order to include bianyun in the affixation 
system, a different classification is proposed in §2. §3 examines intra-dialectal variation that 
provides further support for the proposal. Finally, a conclusion is given in §4. 



1. Types of Affixation 



The traditional view of an affixational typology is determined by Affix Placement, i.e., 
the position of the affix with respect to the stem, based on which we can classify affixational 
processes into prefixation, suffixation, infixation, and circumfixation. In Chinese languages, 
suffixation is the most common type, but the other three are also attested, as shown in (1).^ 
Prefixation is not uncommon in Chinese, but infixation and especi; 'y circumfixation are rare.^ 

(1) Affix Placement : prefixation, suffixation, infixation, circumfixation 

a. Prefixation: Taiwanese a - ma. 'grandmother' 

b. Suffixation: Beijing ygn - er -> par 'board' 

c. Infixation: Pingding xM-er ->xlua 'flower' (Xu 1981) 

d. Circumfixation: Yanggu m -er -> tlur 'rabbit' (Dong 1985) 

Some Chinese languages also exhibit stem internal alternations in a word formation 
process called bianyun (rime change), where the sounds and/or tones in the rime of a stem 
syllable (see e.g., Li 1963, Hou 1985, He 1981, 1982) are modified. Some examples of 
segmental changes from Jiyuan zi bian yunmu (zi changed rimes) are given in (2). The 
examples in (2bc) show segmental changes in the vowel and the coda consonant; (2.d) is a case 



* This reserach is partly supported by the Michigan State Universitiy All University 
Reserach Initiation Grant, project #94-25. 

^ In this paper, I use the terms 'language' and 'dialect' interchageablly. 

2 Throughout the paper, place/city names are used to represent the dialects spoken in the 
area, but common terms like Taiwanese, Cantonese, etc. will be used instead of the place 
names. In all the examples, tones are marked only when relevant to the discussion. 

3 The only case of circumfixation I know of is Yanggu er -noun formation. Lin (1989) 
posits a cirumfix [1 ...r], and Yip (1992) treats -er as a combination of a floating feature 
[lateral] and a suffix [r]. Chen (1992), however, argues that the phonological fomi of the affix 
is simply [-r]. If one accepts Chen's arguments, then Yanggu er affix is a suffix. 
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of vowel-coda segmental merger. In the Yangcheng example in (3), a tonal change 
accompanies changes in both segmental quality and quantity."^ 

(2) Jiyuan zi bian yunmu (He 1981) 

stem zi noun stem zi noun 

a. pi pi:u 'nose' c. tgin t<;i:g 'gold' 

b. xua xuD 'flower' d. pan pa 'board' 

(3) Yangcheng zi bian yunmu (Hou 1985) 

thye (31) thyD: (313) 'rake' 

In a model of morphological theory like that in Anderson (1992), bianyun may be considered a 
typical case of non-processual morphology. If so, bianyun would constitute a separate 
category along with other word formation processes such as reduplication and compounding: 

(4) Word Formation in Chinese: 

a. Affixation (prefixation, suf fixation, infixation, circumfixation) 

b. Bianyun (vowel change, coda change, segmental merger, tonal change) 

c. Reduplication (XXX, XYY, XYXY, XXYY, etc.) (Chiang 1992) 

d. Compounding (NN, VV, NV etc.) (Chao 1968, Li & Thompson 1981) 

This treatment of bianyun , however, misses two important generalizations. First, 
Bianyun and the majority cases of affixation in Chinese derive the same types of words, i.e., 
diminutive/hypocoristic and zi words, indicating a close tie between these two processes (Lin 
1989, 1993). As far as I know, ft/a/jyM/j derives only diminutive/hypocoristic and z/ words, 
so I suggest that bianyun be subsumed under affixation (see §2 below). Second, the various 
alternations induced by bianyun are also typical in many cases of Chinese affixation. 
Consider, for example, Yiwu er suffixation in (5), where we can see vowel lengthening, 
change in coda consonant, and segmental merger (cf. (3) above). We will see more examples 
in §3. 

(5) er suffixation in Yiwu (R. Li 1963, Fang 1986) 

a. di + n -> di:n 'younger brother' 

b. dog + n -» do:n 'basket' 

c. tsau + n — » tso:n 'table' 

In the framework of non-linear morphology and phonology, reduplication may be treated 
as affixation with a copying device (Marantz 1982, McCarthy and Prince 1986 among others). 
McCarthy and Prince (1986) distinguishes two types of reduplication: affixation (partial 
reduplication) and compounding (total reduplication). In Chiang (1992), Chinese reduplication 
has also been analyzed as syllable affixation. As for bianyun, I have argued elsewhere (Lin 
1989, 1993) that bianyun is a special type of affixation with templatic constraints on the 
derived words. If this analysis of bianyun is accepted, then together with the reanalysis of 
reduplication, Chinese word formation processes would consist of only affixation and 
compounding. With the affixation category to encompass both reduplication and bianyun , an 



Tones are marked based on Chao's numeral five-point pitch scale, with 1 indicating the 
lowest and 5 the highest pitch. 
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adequate classification of affixation patterns would need parameters beyond Affix Placement 
This IS the topic we now turn to. 



2. A Typology of Affixation and Bianyun 

Lin ( 1 989) takes into consideration the phonological aspects of Chinese affixation and 
bianyun and provides the following classification: 

(6) A- Rggular affixation : Affixation of a fuU-segment affix with/without syllable contraction 

a. without syllable contraction: e.g. Wushan (Li 1963) 

tau+ z 'er' -4 tau z 'knife' 

b. with syllable contraction: e.g. Rongchang (Li 1963) 

pei+ sr -> par 'cup' 
B. Bianym : Affixation of a feature-sized affix wiUi monosyllabic output. 

e.g. Jiyuan zi bian yunmu (same data as in (3), analysis by Lin 1989, 1993) 
pi + [+bk, +rd] (zi ) ^ pi:u 'nose' 
xua + [+bk, +rd] xua 'flower' 

tgin + [+bk, +rd] ^ tgi:g 'gold' 

pan + [+bk, +rd] -» pa 'board* 

This classification is based on differences in the i.hcnoio ^cal shape of the affixes and 
whether or not die derived word is monosyllabic or f .isyliaDiv.. For our purpose, we may 
Identity two major classifying parameters to characte.-ic this proposal: 

(7) a. Affix Form: full-segment affix 

. _ ..^ ^ feature-sized affix (degenerate affix, Lin 1993)5 

^tem-Afri?< CQntragQon: yes (incorporation of the affix into the stem) 

no (affix as a separate syllable) 

Let us now examine an extensive list of examples classified according to (7). Examples 
of common regular affixation are shown in (8) through (14), where the affix stands alone as a 
separate syllable. WiUi the exception of cases like Taiwanese in (13) and Lichuan in (14) 
where gemination/resyllabification occur, no phonological alter iations are induced in these 
cases. 

A. Affix Form: fuU-segment; Stem-Affix Cnnfrarfinn - no 

(8) zi suffixation in Beijing 

a- tsuo + ts4 -4 tsuo ts4 'table' 
b. phan + ts4 -4 phgn ts4 'plate' 



5 A degenerate affix is in die form of less than a full segment. It may consist of only one 
or a few features, a prosodic weight unit: niora, or the combination of tiiese two. 
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(9) er suffixation in Taihu (Fang 1993) 

a. hua + al hua al 'flower' 

b. tag + al ^ tag al 'bench' 

(10) er suffixation in Hangzhou (R. Li 1963) 

a. tag (44) + 1.(213) ^ tag (44) i (213) 'bench' 

b. khue(44)+ i(213) k^ue (44) i (213) 'chopsticks' 

(1 1) er suffixation in Wushan (R. Li 1963) 

a. k9 + z ^ k9 z 'song' 

b. tau + z ^ tau z 'knife' 

(12) er suffixation in Xining (Zhang & Zhu 1987) 

a. xua + e ^ xua e 'flower' 

b. kD + e ^ kD e 'song' 

(13) zi suffixation in Taiwanese (Yip 1980; Zhang 1983; Chiang 1990) 



a. 


kam 


+ 


a -> 


kam ma 




'orange' 


b. 


ap 


+ 


a 


abba (~ 


a pa) 


'box' 


zi 


and er 


suffixation in Lichuan (Yan 1989) 






a. 


k^io 


+ 


e (zi) 


khio 


e 


'eggplant' 


b. 


hai 


+ 


e 


hai 


ia 


'shoes' 


c. 


men 


+ 


e 


men 


ne 


'mosquito' 


d. 


pa 


+ 


i ier) 


pa 


i 


'scar' 


e. 


pen 


+ 


i 


pen 


ni 


'notebook' 



On the other hand, examples (15) through (20) illustrate the group of dialects in which 
the affix is incorporated into the stem, often resulting in phonological alternations. We have 
seen vowel lengthening, nucleus-coda merger, and coda replacement by the suffix in Yiwu 
above (examples repeated as (15)). The replacement of the coda segment with the suffix is a 
common result of stem-affix contraction; two more cases, Beijing and Luoyang, are given in 
(16) and (17). Epenthesis may occur to bridge between a high nuclear vowel and the newly 
incorporated suffix, as in (16a) in Beijing and (18c) in Zhengzhou. The Rongchang examples 
in (19) present a case where the suffix substitutes the whole rime of the stem,^ while the 
examples in (20) show tonal substitution. As discussed in Yip (1992) and Lin (1993), the 
stem and affix elements can both be retained as long as the resultant syllable is well-formed. 
For example, in (19e), the stem rime is not replaced by the suffix; rather, they both coexist in 
the derived word. 



6 Another example of this sort is Anxiang er suffixation (Ying 1988, Yip 1992). 
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B. Affix Form : full-segment; Stem-Affix Contraction: yes 
(15) er suffixation in Yiwu (R. Li 1963, Fang 1986) (=(5)) 



a. 


di + n — > di:n 


'younger brother' 


b. 


dog + n — > do:n 


'basket' 


c. 


tsau + n — > tso:n 


'table' 


er 


suffixation in Beijing (C. Cheng 1973, R. Li 1963) 


a. 


1 T r — 7 IT 




b. 


pa + r — > par 


'rake' 


c. 


pan + r -> par 


'board' 


H 

vi* 


k'^uai + r — > k'^uar 


•lump' 


e. 


iag + r — > iar 


'sheep' 


f. 


kou + r -> ko"r 


'dog' 


er 


suffixation in Luoyang (He 1984) 




a. 


pi + ui — > piui 


'nose' 


b. 


man + ui — > maui 


'door' 


c. 


ma + ui — > meiu 


'mother' 


zi 


suffixation in Zhengzhou (R. Li 1963) 




a. 


ua + u — > uau 


'socks' 


b. 


guE + u -> guau 


'boots' 


c. 


pi + u — > piau 


'nose' 


er 


suffixation in Rongchang (R. Li 1963, Lin 1989) 


a. 


pei ar — > par 


cup 


b. 


kag + ar — > kar 


'cistern' 


c. 


tau + ar — > tar 


'knife' 


d. 


kuan + ar — > kuar 


'officer' 


e. 


u + ar — > UaT (* 


ar) 'fish 



(20) er suffixation in Tunxi (Qian 1991) 

a. mi (55) + n(24) -> 

b. khua(53) + n(24) -> 



min (24) 'riddle' 
k''uan(24) 'chopsticks' 



With the parameters in (7), Bianyun can then be defined as a type of affixation with a 
different parameter setting from the two regular types discussed-above. Examples are^pven in 
(21) through (27). A feature-sized affix consisting of a syllable weight unit, i.e. a mora, may 
contribute to vowel lengthening (Yangcheng in (21), Hedhun in (22)) and thus affect the 
syllable weight of the affixed ou^ut; on the other hand, an affix consisting of only a tonal 
feature (Cantonese in (23)) or segmental fcatuies (e^.. Jiyuan in (24)) result in feature changes 
in the stem. Ute-piosodic mora, whidiiisotevoid of -segmental feature contents, could be 
filled by the nuclear vowel of the stem resulting in vowel lengthening as in Yangcheng (21) and 
Heshun (22) or by the default vowel avS in Huojia (25). In addition to vowel lengthening and 
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epenthesis, the surface alternations also result from the creation of a new segment by imposing 
the affixal features on the syllabic coda of the stem, e.g. (21c) and (24cg), or by merging the 
nuclear vowel and the coda, e.g. (24d) and (25bc). (Sir.ce this paper concerns mainly on the 
general classification of affixation types, we are not ab to discuss the analysis of bianym and 
readers are referred to Lin (1993) for details.) One th ng worth noting is that in (21) through 
(24), the derived words may consist of either closed or open syllables, while those in (25) 
through (27) require an open syllable output Such requirement, according to Lin (1993), 
triggers segmental merger, e.g. Huojia in (25). Segmental losses in (26b) and (27b) may also 
be considered a response to the same restriction. 

C. Affix Form : feature-sized; Stem- Affix Contraction: yes 

(21) zi rime change in Yangcheng (data from Hou 1985, analysis from Lin 1993) 

a. t^i + ^ ... [+bk, +rd] -> thi:u 'ladder' 

b. pa + M- ... [+bk, +rd] -> p^: 'rake' 

c. cin + n ... [+bk, +rd] -> ci:g (ci:Oi}) 'heart' 

(22) zi rime change in Heshun (data from Tian 1986, analysis from Lin 1993) 



a. 


lu 


+ 




lu: 


'stove' 


b. 


tai 


+ 




-> ta:i 


•bag' 


c. 




+ 


^ 


li:9 


•collar' 



(23) Cantonese duninutive tonal change (Yip 1980) 

a. uu: (21) + [+H] -> uu: (35) 

b. tSoe:g(53) + [+H] -> Soetg (55) 

c. yuk(22) + [+H] yuk(35) 

(24) er and zi rime changes in Jiyuan (data from He 1981, analysis from Lin 1993) 



a. 


pi 


+ 


[+bk, +rd] (z/ ) 


— > pi:u 


'nose' 


b. 


xua 


+ 


[+bk, +rd] 


— > xuo 


'flower 


c. 


tgin 


+ 


[+bk, +rd] 


tgi:g 


•gold' 


d. 


pan 


+ 


[+bk, +rd] 


-> pa 


•board' 


e. 


pi 


+ 


[-bk,+ixi] (er) 


— > piu 


'nose' 


f. 


ma 


+ 


[-bk, +rd] 


— > mee 


'horse' 


g- 


gin 


+ 


[-bk, +rd] 


-> giu 


'heart' 


h. 


pan 


+ 


l-bk, +rd] 


-> po 


'board' 



(25) D rime change in Huojia (data from He 1982, analysis from Lin 1993) 



a. 


li + 


^('D') 


> lia -> 


ie 


'Li (surname)' 


b. 


sun + 




> suan — ) 


sue 


'Sun (surname)' 


c. 


tig + 


^ 


» tiag — ) 


ti5 


'Ding (surname)' 



(26) er suf fixation in Dinghai (data from Fang 1993, analysis by Lin) 



a. 


ba 


+ 


[+nas, -bk] — > 


be 


'card' 


b. 


kai 


+ 


[+nas, -bk] — > 


ki 


'dog' 


c. 


gau 


+ 


1+nas, -bk] — > 


98 


'goose' 
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(27) er suffixation in Ezhou (data from Wan 1990, analysis by Lin) 

a. tso + [-bk. +lo] tsa 'table' 

b. (?ian + [-bk, +lo] gie 'heart' 

c. kau + [-bk, +lo] kse 'cake' 

Although the classification based on (7) appears to have successfully merged affixation 
and bianyun , it has not addressed the question of variation in syllable weight As mentioned 
above, the contraction of the stem and the affix under bianyun produces two distinctive output 
types: a single syllable (either open or closed) and a strictly open syllable. Such a distinction 
which goes beyond the Stem-Affix Contraction parameter is also called for in cases such as 
Jiyuan bianyun in (24) where the derived words are grouped into two: a strictly open syllable 
is required when the nucleus of the stem is a low vowel (24dh), but a high vowel may be 
followed by a coda nasal or glide (24aceg).7 

In addition to Affix Placement (as in (1)), I propose three new parameters for a complete 
classification of Chinese affixation patterns: Affix Form, Stem-Affix Contraction, Syllable 
Weight The options within each parameter are shown in (28). (29) exemplifies how these 
three parameters cross-classify the diverse affixation patterns in various Chinese languages. 

(28) a. Affix Form: (i) FuU-segment affix 

(ii) Degenerate affix 

b. Stem-Affix Contraction: Yes/No 

c. Syllable Weight: (i) a heavy bimoraic syllable 

(ii) a light monomoraic open syllable 

(29) A. Stem-Af^xX Contraction : No 

Affix Form : full-segment affix 

Examples: Beiiing zi suffixation. Taiwanese zi suffixation, etc. ((8)-(14)) 



B . Stem-Affix Contraction : Yes 

Affix Form Syllable Weight 



Full-segment 

Full-segment 
Degenerate 

Degenerate 



heavy 

light 
heavy 

light 



Examples 

Yiwu er suffixation, Beijing er 
suffixation, etc. ((15)-(20)) 



Yangcheng zi suffixation/infixation (21) 
Heshun zi infixation (22) 

Huojia D infixation (25) 
Ding hai er suffixation (26) 
Erzhou er suffixation (27) 



Notice that not every combination of these parameters has attested examples. As shown 
in (29) there is a gap marked by a question mark. If the affix is separate from the stem, the 
affix is always in the form of a full-segment syllable; I have not found cases where a feature- 
sized affix could become a separate syllable in the output. There are two possible ways to 



^ I will come back to the transitional cases across parameters below. 
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solve the problem: one is to reduce the number of parameters so as to eliminate redundancy, tlie 
other is to invoke universal principles/constraints for an explanation of the gap in the paradigm. 
The reduction of the number of parameters would result in undergeneralization and fail to 
reveal the similarities and differences between regular affixation and bianyun . An explanation 
of the gap lies in the concept of maximization. Maximality Principle in (30) intends to capture 
the generalization that languages tend to retain as much information as possible. 

(30) Maximality Principle (Prince 1985, Ito 1989): 

Units are of maximal size, within the other constraints on their form. 

If the output shape of a word is restricted to be a light open syllable, a full-fledged affix does 
not stand a chance to surface. As examples in (15)-(20) show, e.g. Beijing er suffixation, 
incorporation of a full-fledged suffix into the stem always creates a heavy syllable since the 
suffix would occupy the coda position of the syllable. On the other hand, a feature-sized affix 
may surface by being linked to the nucleus of the stem to yield a new segment; e.g., as shown 
in (25) through (27), the coda as well as the suffix are merged with the nuclear vowel, deriving 
a segment that contain the features of both the suffix and the original rime. The requirement for 
a li^t open syllable output and the addition of a full-segment affix is contradictory, and to 
achieve such a combination would require extensive loss of materials, a situation deviating 
away from the Maximality Principle. If this explanation is on the right track, we may never 
find examples to fdl the gap in (29). 

In this section, I have demonstrated that three new parameters successfully cross-classify 
various types of regular affixation and bianyun in Chinese and a systematic gap of this 
typology may be accounted for by an appeal to the universal principle of maximality. 



3. Transitional cases 

Consider now (31) through (34). These arc dialects that cannot directly be classified into 
the system in (29); they usually exhibit a mixed system allowing different options of one 
parameter to coexist I consider them to be transitional cases which are in the process of 
moving from one type of affixation to another type. In Mancheng, syllable contraction usually 
applies, but, as we can see in (3 lab), if the stem ends in a velar nasal or the high back vowel 
[u], the suffix would stay as a separate syllable. Lanzhou freely allows the variation between 
the bisyllabic or monosyllabic ou^ut (32). The Yuanyang examples in (33cd) show that when 
the stem ends in a nasal, the derived zi word has to be an open light syllable.^ As mentioned in 
§2, the outputs of Jiyuan er and zi suffixation in (24) vary between heavy and light syllables; 
Huojia zi suffixation shows similar behavior, examples are given in (34). 

(31) er suffixation in Mancheng (Chen 1988) 



a. 




+ 


9r 


4 ag gar 


'vegetable' 


b. 


au 


+ 


9r 


4 au uar 


'peach' 


c. 


u 


+ 


9r 


4 u9r 


Tish' 


d. 


pin 


+ 


9r 


piar 


'heart' 


e. 


p^'an 


+ 


ar 


p^'er 


'plate' 



8 See Yip (1992) and Lin (1993) for analyses. 
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(32) er suffixation in Lanzhou (Gao 1985) 

a. ma + m — > ma ui ~ maui 'horse' 

b. p*ie + UI — > p*ie ui ~ p*ieui 'plate' 

(33) zi suffixation in Yuanyang (R. Li 1963) 



a. 


§ua 


+ 


u 


-4 guau 


•brush' 


b. 


pi 


+ 


u 


-4 pi^u 


'nose' 


c. 


p^'an 


+ 


u 


-> p^a 


'plate' 


d. 


lian 


+ 


u 


-> lia 


'curtain' 



(34) zi suffixation in Huojia (data from He 1982, analysis from Lin 1993) 



a. 


pi 


+ 


[+bk, +rd] 


> pi:u 


'nose' 


b. 


t^ii 


+ 


[+bk, +rd] 


» t^iiiu 


'young horse 


c. 


§a 


+ 


[+bk, +rd] 




'fool' 


d. 


t^hye 


+ 


[+bk, +rd] - 


» t^i^'yo 


'eggplant' 


e. 


p^yaw 


+ 


[+bk, +rd] 


» p^'yD 


'ticket' 


f. 


fag 


+ 


[+bk, +rd] 


» f5 


'house' 


g- 


t^iin 


+ 


[+bk, +rd] 


^ tQi:g 


•gold' 



These transitional cases can be characterized according to the proposed parameters as the type 
of affixation that allows the coexistence of both options within the Stem- Affix Contraction or 
the Syllable Weight parameters. The proposed classification is shown in (35). The existence of 
such transitional cases further supports the need to recognize these two paraineters. 

(35) a. Affix Form : Full-segment 

Stem- Affix Contraction : sometimes yes, sometimes no 

Examples: Mancheng er suffixation (31), Lanzhou er suffixation (32) 

b. Affix Form : Full-segment 
Stem- Affix Contraction : yes 

Syllable Weig ht: sometimes heavy, sometimes light 

Example: Yuanyang zi suffixation (33) 

c. Affix Form : Degenerate 
Stem- Affix Contraction : yes 

Syllable Weight : sometimes heavy, sometimes light 

Example: Jiyuan zi and er suffixation (24), Huojia zi suffixation (34) 

We have seen that the same set of parameters account for both the inter-dialectal and intra- 
dialectal variations in the patterns of affixation. These variations may shed light on how one 
type of affixation changes to another. I hypothesize that the different types of affixation in (29) 
represent different stages of a changing process in which a full-segment affix is reduced to a 
feature-sized affix, stem-affix separation is being replaced by stem-affu merger, and the 
nwrged forms are changing from closed heavy syllables to light open syllables; The transitional 
types in (35) then showcase the change in progress. In terms of the proposed parameters, the 
hypothesized changing process may be characterized as in (36). 
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(36) Affix Form: Full-segment affix degenerate affix 

Stem- Affix Contraction: No -> Yes 

Syllable Weight: Heavy Light 

A detailed investigation to support the hypothesis is beyond the scope of this paper and will be 
left for future research. The analysis provided here, nevertheless, points out the direction 
along which one could gain understanding of the formal mechanisms involved in the change of 
Chinese affixational pattems. 



4. Conclusion 

In this paper, I propose an expansion of the parameters for a typology of Chinese 
affixation beyond the traditional general classification in terms of Affix Placement. The 
process of identifying the appropriate parameters is based on the morphological and 
phonological pattems of regular affixation and bianym. The systematic gap in the paradigm is 
suggested to be accounted for by a universal principle. This study not only suggests a new 
perspective in investigating the similarities and differences of Chinese affixation pattems but 
also provides a clue for a further study of the mechanisms of variation and change in affixation. 
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DISCOURSE ORGANIZATION AND ANAPHORA IN SPOKEN AND WRITTEN 

CHINESE DISCOURSE* 



Ming-Ming Pu 
University of Oregon 



1. Introduction 

For the past two decades, anaphora has been the focus of considerable research 
on discourse production and comprehension because it is fundamental in imderstanding 
the relationships among cognitive processes, discourse structtire and information 
distribution. There are three influential models of discourse anaphora in the functional 
domain: the distance model (Giv6n, 1983, 1989), the structural model (van Dijk & 
Kintsch, 1983, Fox, 1987; Hinds, 1977, 1979; inter alia), and the attention model 
(Chafe, 1987; Tomlin, 1987). 

The distance model argues for a correlation between anaphora and 
referential distance in discourse, e.g., ntmiber of clauses between a given anaphor and 
its antecedent. The distance model could be a manifestation of a psychological factor, 
such as short-term memory decay. According to an "iconicity principle" xinderlying the 
model, the longer the distance, the harder it is for the hearer to identify the referent, and 
so a more expUcit referential form (e.g., a full noun phrase) is required. The shorter the 
distance, the easier it is for the hearer to identify the referent, and hence a less expUcit 
referential form (a lexical pronovm or a zero anaphor-ellipsis) is reqmred (Givon, 
1983:18). The model recognizes various psychological factors that underUe the 
distribution of anaphora. However, it overemphasizes the linear nature of discourse and 
thus fails to account for instances of long-distance pronominalization and short- 
distance nominalization. 

The structural model emphasizes the relationship between discourse 
structure and anaphora. The hierarchical structure of discourse allegedly controls the 
use of anaphora: NPs (full noun phrases) are often used at the beginning or peak of a 
structural unit (e.g., episode, paragraph, etc.), while pronominals (lexical and zero 
pronouns) are often used within such a structural unit. The model presupposes the 
importance of hierarchical organization of discourse. Unfortimately, the problem faces 
difficulties to the extent that structural xinits such as paragraph, episode, event, theme, 
etc., are not well defined theoretically. Many structural vmits are hard to identify m 
spoken and written texts, and are prone to misinterpretation. 

The attention model emphasizes the role of cognitive processes, such as 
attention and memory, in guiding anaphoric choice in discourse. Tomlin (1987) defines 
these psychological factors in terms of a discourse unit (i.e., episode). He argues that an 
episode represents sustained attentional effort and endures imtil attention is diverted 
(i.e., an episode boimdary is reached). He demonstrates that NPs are used at the 
boundary of episodes when attention shifts, while pronominals are used within episodes 
when attention sustains (see also Tomhn & Pu, 1991). The model shows greater 
sensitivity to subjects and text-specific variations than other approaches in relatively 



* The author is indebted to G. Prideaux and R. Tomlin for their comments and suggestions on the 
paper. The study was supported by SSHRC postdoctoral fellowship number 756-92-0112. 
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simple production tasks. However, the model is less effective in accounting for 
anaphoric patterning in more complex spoken and written production and 
comprehension because the model seems to ignore the critical role played by social, 
interactional, and affective factors. 

While sharing the view that there is an important connection between a 
particular linguistic unit (namely, the episode/paragraph), and a cognitive factor 
(namely, the Umited capacity of working memory), the present study departs from the 
prior research in two important ways. First, the study argues that cognitive 
constraints are not the sole factor in determining a speaker's anaphoric choice. Most 
specifically, discourse is not merely organized in terms of information flow and 
propositional content. There are oflen factors that relate discoijrse and anaphora, such 
as discourse structure, pragmatic information, and interpersonal factors. We cannot 
provide a complete accotint of the distribution of anaphora in discourse processing 
unless we take into consideration all of these factors. Second, the study compares 
spoken with written Chinese narratives, and demonstrates that the two modaUties 
exhibit an overall similar pattern of anaphora, although some differences exist because 
of the specific characteristics of the two types of discourse. 

In what follows. We will first explore three important aspects of discourse 
processing-cognitive constraints, discourse structure, and pragmatic considerations, 
and the relationship between the three factors and the use of anaphora in Section 2. We 
will then provide a general interactive principle determining the basic pattern of 
anaphora in Section 3. While Section 4 will present an experimental study to test the 
general principle of anaphoric patterning and discusses the results of the experiment. 
Section 5 will illustrate, with a text-data analysis, that the general principle is also 
operative in written narratives. Finally, Section 6 will discuss the general findings and 
the implications of the present study. 



2. Factors determining the basic pattern of discourse anaphora 

The present study proposes, with data drawn from both spoken narrative 
production tasks and written discourse, that anaphoric choices made by Chinese 
speakers are constrained by cognitive, discourse, and pragmatic factors. Cognitive 
constraints refer to the memorial and attentional processes that underlie anaphoric 
patterning during narrative production. Discourse constraints specify speakers' 
hierarchical organization of cHscourse into smaller units and the marking of these units. 
Rragmatic constraints include speakers' intention of signaling hearers of the status of a 
given referent, their effort to avoid referential ambiguity, and their empathy with 
hiunan central characters. Although these factors have been discussed in theories of 
anaphoric production in cognitive science, psycholinguistics (Chafe, 1987; van Dijk & 
Kintsch, 1983; Gejiisbacher, 1990; Tomlin, 1987), discourse processing (Fox, 1987; 
Givdn, 1983; Hinds, 1979; Marslen-Wilson, Levy, & Tyler, 1982; Tannen, 1982) and 
pragmatics (Brown, 1983; Giv6n, 1989; Grimes, 1978), they tend to be explicated in 
isolation of one another. We argue that these three factors represents three dimensions 
of the relationship between discourse and anaphora: the plane of cognition, the plane of 
discourse structure and the plane of pragmatics. They integrate and interact to 
determine a speaker's anaphoric choice throughout discourse. Cognitive constraints 
characterize structured representation of information in memory, which is manifest 
most conspicuously by the hierarchical units of discourse. Discourse structure of 
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various levels controls the basic pattern of anaphora -with regard to the location of each 
specific referent (e.g., structure-initial or structure-internal), and thematic coherence of 
each discourse unit. Pragmatic considerations specify speaker's empathy with human 
central referents and speaker-hearer interaction. Without structural factors, the use of 
anaphora would appear to be random, and without pragmatic consideration, anaphoric 
patterning would not be complete. 

2. 1. Cognitive constraints, discourse organization, and anaphora 

The cognitive basis of episodic organization of discourse has been extensively 
investigated in linguistics, psychology, and cognitive science. Studies have shown that 
speakers, who are constrained by limited memory capacity, try to organize the overall 
discourse into sequences of episodes. Each episode consists of a sequence of sentences 
dominated by a piacroproposition (van Dijk & Kintsch, 1983). The macroproposition 
relates sentence propositions at a higher level and thus derives the global meaning of an 
episode or a whole discourse from the local sentential meaning of the discourse. 

The notion of episode as a semantic tmit dominated by a macroproposition 
has been found to have psychological relevance. Black and Bower (1979), for example, 
demonstrated in a psychological study of story processing the existence of episodes as 
chimks in narrative memory. Similarly, Guindon and Kintsch (1982), in their 
experiment studying the macrostructure of texts, found that macrostructure formation 
appears to be a virtually automatic process. That is, people appear to form 
macrostructure dming reading and derive relevant macropropositions of a passage as 
soon as possible. Their findings provided evidence for the "episode" and the 
"macrostructure" theories of van Dijk and Kintsch (1978, 1983) . 

Other studies of story processing (Mandler & Johnson 1977, Haberlandt, 
Berian, & Sandson 1980, Gemsbacher, 1990) suggest that readers slow their 
processing at or aroimd the episode boimdary. The increased reading time at bovmdaries 
exceed that which would be predicted on the basis of sentence level and text level 
factors. The boimdary hypothesis, which derives fi:om these findings, assumes that 
there are cognitive processes at or aroimd the episode boundary which are not present 
inside the episode. At the beginning, readers shift fi:om actively building one 
substructure to start another, and laying the foundation for the new episode consumes 
more mental efifort. Haberlandt et al. (1980), who tested the boundary hypothesis with 
reading and recall experiments, found that the encoding load was greater at the 
boundary nodes than elsewhere, suggesting that readers are sensitive to episode 
boimdaries and use them in encoding story information. 

Gemsbacher (1990) supports the episodic organization in story 
comprehension on the basis of various experimental results. She reports that 
comprehenders capture the episode structure of narratives in their mental 
representation by building separate substructures to represent each episode. The 
readers shift to build new substructures for new episodes, when and where information 
of the previous episode is less accessible to them. It is therefore harder for readers to 
draw coherence inferences across two episodes than within the same episode. 

The cognitive basis of discourse organization helps us further imderstand the 
relationship between discourse structure and anaphora. An episode, as a semantic imit 
subsumed under a macroproposition, is the textual manifestation of a memory chunk 
which represents sustained attentional effort and endures imtil an episode boimdary is 
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reached. Attention shifts when the processing of the episode is completed. In other 
words, "the macroproposition remains in Short Term Memory for the rest of the 
interpretation of the same episode. As soon as propositions are interpreted that no 
longer fit that macroproposition, a new macroproposition is set up" (van Dijk, 1982, p. 
191). At an episode boundary where a change of macroproposition occurs (i.e., new 
agents, places, times, objects or possible worlds are expected to be introduced), the 
encoding load is much heavier, the reference under concern is less accessible, and hence 
a more expUcit anaphoric form (e.g., an NP) is required to code the referent. Within an 
episode, when the macroproposition is maintained, the referent under consideration is 
more accessible and hence a less explicit anaphoric form (e.g., a pronominal) is 
sufficient to code the reference. 

Indeed many studies on anaphora have reported the alternation between NPs 
and pronominals to be a fimction of the paragraph or episodic structvire. Hinds (1977), 
for example, discusses how paragraph structure controls the choice of NPs and 
pronouns. He finds that noim phrases are used to convey "semantically prominent" 
information in peak sentences of a paragraph while pronouns are used to indicate 
"semantically subordinate" information in non-peak sentences. Fox (1987) 
demonstrates that structtiral factors of discourse establish the basic pattern of 
anaphora: NPs are generally used at the beginning of a "development structure" to 
demarcate new narrative units, whereas pronominals are used within that structure. 
Marslen- Wilson et al. (1982) also argue that a speaker's use of referential devices is 
governed by discotirse structure and the context of speaking. The general pattern of 
anaphora is that NPs and proper names are used to establish initial reference at an 
episode when a particular referent is in a state of low focus, whereas pronouns are used 
to maintain reference within an action sequence when a particular referent is in a state 
of high focus. 

2.2. Pragmatic, interpersonal factors and anaphora 

In addition to discourse structure, pragmatic and interpersonal factors also 
affect the speaker's anaphoric choices. Speakers, at any given moment, try to help 
hearers build a structure representation of discourse congruent with their own in order 
to convey the intended message succ . sfully. The speakers' assessment of the hearers' 
current knowledge afiects both what is said and the structtires chosen for saying it. 

In narrative production, speakers' referential choice is based partially on an 
assessment of their hearers' knowledge with respect to a particular referent, and they 
provide guidance for the hearers to identify uniquely each given referent through the use 
of anaphoric form. If speakers believe that a concept has already been "activated" or is 
resident in the hearers' consciousness (Chafe, 1987), they will treat that concept in an 
attenuated manner, most likely pronominalizing it. If speakers believe that the concept 
has not yet been activated, they will treat it in a less attenuated manner, most 
probably nominalizing it. If speakers believe that they need to disambiguate referents 
for their hearers, they will nominalize them to resolve the ambiguity. In general, 
speakers' anaphoric choice seems to foUow closely Grice's (1967) dictimi: do not be more 
informative than required. 
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Moreover, when two or more referents have been activated and compete for 
attention, hiiman referents are preferred to be pronominalized over non-himian or 
inanimate referents. Speakers tend to empathize with a human (Kuno & Kaburaki, 
1977, Brown, 1983) because himians are generally more topical, more central, and 
more frequently attended to in narratives. On the other hand, when two or more human 
referents are competing for focal attention, the protagonist of a narrative tends to stay 
in focus longer than non-central characters and is consequently more Ukely to get 
pronominalized (Currah, 1990). 



3. A general principle of anaphoric patterning 

Based on the interaction of all three factors explicated above, the present study 
proposed the following general hypothesis for the anaphoric patterning in narrative 
production. 

The basic pattern of anaphora throughout discourse is controlled by 
speakers/writers' organization of discourse into episodes, that is in turn 
constrained by cognitive processes of attention and memory. The 
pattern is completed by the consideration of pragmatic information 
available for each specific referent. 

The general hypothesis involves several claims. First, episodes represent separate 
memory tmits in discourse processing. Narrative discourse is not only memorized, 
stored and recalled as episodes, but also produced as episodes. Second, episodic 
structure partially controls anaphoric patterning. NPs are used at the beginning of an 
episode when attention shifts and the reference is less accessible; pronominals are used 
within an episode when attention sustains and the refe. snce is more accessible. Third, 
interpersonal and pragmatic considerations complete the pattern. While indefinite NPs 
are used for the first mentions of referents anjrwhere in discourse, definite NPs are 
used for reinstating reference at the boundary, resolving referential ambiguity, and 
coding nonhuman and noncentral reference within an episode or a subimit. 

In order to test the hypothesis and hence the above three predictions, 
anaphoric patterns in both spoken and written narratives are examined in both 
experimental condition and naturally occurring written texts. 

Since the constmct of episodes plays a crucial role in the present study, 
definitions are needed for the theoretical concepts of episode and episode boimdary. The 
definitions are defined according van Dijk (1982), van Dijk and Kintsch (1983), and 
Tomlin (1987),. / 

Episode . An episode is defined cognitively as a memory unit/chupk in the 
flow of information processing. Attention is sustained in an episode until an episode 
bovmdary is reached. Linguistically, an episode is a semantic unit subsumed under a 
macroproposition. The macroproposition is generally a topical expression, which is 
derived from a sequence of sententially expressed propositions of discourse. Episodes in 
a discourse may be of varying length or scope. 

Episode boundary . An episode is conceived of as a part of a whole discourse, 
having a beginning and an end. The beginning and end of an episode are defined in terms 
of propositions subsumed under the same macroproposition, while the propositions 
preceding the first and following the last proposition of an episode should be subsumed 
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under different macropropositions. The transition between macropropositions represent 
episode boundaries. They are normally marked by expressions denoting changes in 
time, place, scenery, participant, perspective, possible world, etc. Cognitively, 
boundaries may also be manifestations of attention shifts. 



4. The experiment 

The experiment was conducted to examine the relationship between cognitive 
processes, discourse structure and use of anaphora in speakers' narrative production. 
More specifically, it was designed to test (a) if the structural unit of episodes has 
psychological relevance, (b) if the episodic structure controls the basic pattern of 
anaphora, and (c) if pragmatic and interpersonal factors are employed to complete the 
anaphoric patterning. 

In the present experiment, episode boundaries were operationally defined 
and manipulated by imposing perceptual disruption (i.e., video-cuts) in the flow of visual 
materials. The manipulation of speakers' attentional effort would presumably affect 
their episodic organization and hence their use of anaphora throughout discourse 
production. 

4.1. Experimental Method 

Stimylvs materials- The stimulus material for the study consisted of 
adaptions of three excerpts from a children's picture storybook (without a written text) 
about a little boy, " Here come.!tAlex PumpemjckpF (Krahn, 1981). The picture book was 
chosen for several reasons. First, many of the cognitive processes and mechanisms 
involved in language processing are not specific to language (Gemsbacher, 1990). They 
are general cognitive processes and mechanisms. Comprehenders easily segment 
stories after viewing a non-verbal picture story, or watching a movie without a dialogue 
(Baggett, 1979). Second, the book consists of eight separate, but related episodes of a 
story. Each episode describes some activities during a day in Akx's life and each episode 
is subtitled. Third, with the subtitles removed from the stimulus material, the subjects' 
recognition of episodes in this experiment would be independent from linguistic 
infonnation. We would thus avoid risking the problem of circularity in defining and 
identifying episodes. The purpose of the experiment was to see if subjects would 
organize, store, produce or recall the non-verbal story in terms of episodes after viewing 
the picture sequence without any linguistic clues. 

The three episodes adapted for the present experimental study are subtitled: 
(a) Msx. Pumpernickel in a stkkv situation ( 12:00 p m. )^ {h)ALex Purkpemickel ftwnt.t fq. 
flyl(2;00p.n\.}, and (c) AZgx Pumpernickel lends a hand (10:00a.m.) . These three 
particular episodes were selected because of some pragmatic characteristics of 
anaphora to be investigated. Each episode consisted of (a) himian, nonhviman and 
inanimate referents, (b) himian central versus noncentral characters, (c) old versus 
new characters and human characters of the same versus different gender. These 
options would permit us to assess whether the pragmatic considerations of empathy, 
centrality, and ambiguity resolution play a role in subjects' anaphoric choice. 

Each of the three episodes consisted of eight pictures, presented in pairs on 
each page. The three episodes (i.e., twelve p airs of pictures, with subtitles removed) 
were made into a black and white video prrgram. The video could be viewed as a cartoon 
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sequence of 12 pairs of pictures from a Macintosh screen. The resulting video program 
was designed to provide as little background as possible. 

Experimental conditions. While watching the video, subjects had to 
press the computer mouse button to advance from one picture to the next. The 
transition between pairs took approximately 3 seconds. At the moment the mouse was 
pressed, its click and the noise coming from the computer as it changed pictxares were 
clearly audible. The brief interrupting period between the video-cuts, together with the 
accompanying noise, provided strong visual and auditory disruption to the subjects' 
attention. The disruption between each pair of pictxares was inserted to manipulate 
subjects' cognitive processes of attention and memory. In other words, it served as an 
imposed episode boundary, which would force subjecte to reorient their attention (and 
hence reorganize episodic structxares) so as to continue with their production task. 

Two experimental conditions, Even and Odd, were established to test the 
present hypothesis. In the Even condition, the pictxare sequence was presented in the 
original pairs (twelve pictxire frames); that is, the three original episode boxmdaries did 
not cut into any of the twelve imposed boxmdaries. In the Odd condition, the first single 
pictxare of the first episode was presented alone and the rest of pictxares were in pairs, 
with the last single pictxire of the last episode also presented alone. There were 
therefore thirteen pictxare frames in the Odd condition, with two of the three original 
episode boxmdaries being embedded in two of the pictxare frames. That is, the two 
original episode boxmdaries conflicted with two imposed boundaries. 

Subjects . Twenty volxmteers participated in the experiment. They were all 
adult native speakers of Mandarin Chinese at the University of Alberta. Half of the 
subjects were male and half female. All subjects completed the experiment in Mandarin 
Chinese. The subjects were assigned randomly to two conditions. Even and Odd. 

Procedxores . There were two narrative production tasks: an on-line 
description task and a recall task. In the on-line task, subjects were asked to watch the 
video program and at the same time produce a story based on the pictxires presented on 
the screen. They were told to take as much time as needed for each single or pair of 
pictures. Once finished with a screen, they coxild not see it again. 

Since discoxarse organization is assumed to be a manifestation of cognitive 
processes, it was expected that subjects would respond to the episode boundary in 
exactly the same way, regardless of how the pictxare sequence was presented. In other 
words, subjects were expected to recognize and mark the episode boxmdary with full 
NPs regardless of whether or not it was embedded within a pictxare frame. 

Upon completion of the on-line description, subjects were asked to recall the 
entire story they had just described. They were instructed to retell as much as possible 
of the story, without seeing the pictxare sequence. Since no video-cuts were present in 
the recall task, subjects were expected to retrieve the story as consisting of three 
original episodes, regardless of their experimental conditions. Because the episodes are 
assumed to act as separate memory units/chunks, subjects shoxold be able to structxare 
and mark such xanits lingxaistically. 

In the recall task, each of the Mandarin groups (i.e.. Even and Odd) was 
divided into two sxibgroups: five of each group performed the recall task in oral form and 
the other five in written form. The task was so divided because Mandarin Chinese 
makes no gender distinction among third-person pronouns in oral form; all third-person 
singxolar pronoxms ("he/she/it ") have the same pronxmciation ta. Chinese subjects 
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therefore might have to use NPs to distinguish male characters from female characters 
in orally retelling the story. However, in written Chinese, a gender distinction is present 
for personal pronouns, and there are three different forms for "he," "she," and "it." By 
performing a written recall task, subjects would be able to use disambiguating pronoims 
instead of NPs. Thus, it could be possible to distinguish disambiguating anaphors from 
those sensitive to episode boundary conditions by comparison of oral and written 
productions. 

4.2. Results and discussion 

General Performance The subjects' general performance across 

conditions and tasks was very similar in terms of anaphoric production. Subjects in 
each group produced almost the same number of NPs (Even, 113; Odd, 117) and 
pronominals (Even, 85; Odd, 77). No statistically significant dififerences were found. 
Moreover, when written and oral narratives were compared, no difference was found in 
the use of lexical pronoims for the human central character: for the Even condition, 
27% in the written and 27% in the oral; for the Odd condition, 23% in the written and 
23% in the oral. As for the himaan non-central characters, lexical pr6nouns used in the 
written recall were less frequent than those used in the oral recall. Since no differences 
were foimd in subjects' anaphoric choice between written and oral recalls, the two sets 
of data were combined in the present study. 

Humanness and Centralitv of Referents As discussed in the previous 
section, speakers tend to empathize first with a hvmian in narratives and use this 
pragmatic information in encoding referents. This prediction was home out in our 
experiment. Figure 1 below shows the frequency distribution of pronominals over the 
three types of referents (i.e., human, nonhtiman and inanimate) for both conditions. 
While about 49% of human referents were coded by pronominals, nonhxraian and 
inanimate referents were coded by pronominals only about 25% and 11% of the time 
respectively. 
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Figure 1: Pronominal Distribution by humanness 
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Based on Figure 1, the following hierarchy can be proposed for anaphora and 
pragmatic factors: 

Pronominal: Human > Nonhuman > Inanimate 



(NP: 



Human < Nonhuman < Inanimate) 



The hierarchy, which conforms to Kuno & Kaburaki's (1977) empathy hierarchy, 
illustrates a general pattern of anaphoric choice over different types of referents. Since 
himaans are generally more topical, more central, and more frequently attended to in 
narratives, pronominals (less coding materials) are more frequently used to refer to 
them. On the other hand, the factor of "centrality", as predicted by the present study, 
plays a very important role in determine speakers' anaphoric choice dtiring narrative 
production. Figure 2 below shows the huge difference between the coding of human 
central and noncentral referents in the use of pronominals. 
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Figure 2: Pronominal Distribution by Centrality 

The percentage of human central versus noncentral referents encoded by 
pronominals is about 70% versus 28% on average across conditions. Although these 
differences are striking, they are not surprising because human central referents are 
usually the subject of the narrative, and they tend to be under focus and are discussed 
more frequently than are non-central referents. Moreover, when htiman referents is 
distinguished for centrality, the difference between the proportion of himian noncentral 
versus nonhtmian referents is only marginal. This was restjlted from the differential use 
of lexical versus zero pronouns. While lexical pronouns are rarely used to code referents 
other than himians in Chinese, zero anaphora are very often used to refers to both 
human and non-himian referents. 
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The experimental restdts support the claim that "humanness" and 
"centrality" affect a speaker's anaphoric choice in narrative production: the more 
central a referent is (i.e., usually human), the more it will be attended to, the longer it 
will remain in focus, and consequently attenuated anaphoric devices (i.e., pronominals) 
will be used to code and identify it. 

Episodes a s Memory Units The episode boundary results obtained from 
the recall task in all four groups provided evidence that episodes exist as chunks in 
narrative memory. Although liiere was no written clue in the video stimuli that there 
were three original episodes in the story, 17 out of 20 subjects (85%) recognized the 
three original episodes and mentioned the fact overtly. More interestingly, some 
subjects' recall data showed the specific monitoring role that macropropositions play in 
discourse processing. These subjects first recalled the paragraph level theme, or 
macroproposition, and then the whole episode came flowing out. Some exact wordings 
are "Well, it's about the boy chasing the fly, "Okay, it's about the kid swatting a fly, 
or "Yes, it's about the boy and the fly". 

In addition to the overt mention of the three episodes, subjects consistently 
marked the beginning of each episode by using NPs that reinstate the referent 
throughout their recall task. This demonstrates, as specified by the boundary 
hypothesis (Mandler & Johnson, 1977; Kintsch, 1977; Haberlandt et al., 1980), that 
cognitive processes at episode boundaries are different from those inside the episode. 
The subject had to devote a special effort to encoding the beginning of an episode 
because (a) the sxibject tried to grasp the initiating and topical event of the episode 
during the quick flow of discourse processing, (b) the subject identified the protagonist of 
the episode and established a new memory location for the protagonist, and (c) at the 
beginning, the subject shifted the perspective, breaking the sustained attentional effort 
for the previous episode even when the protagonist of the episode remained the same. 
In general, much as Gemsbacher (1990) observes, subjects shift to build a new 
substructure for a new episode, when and where more cognitive efforts are required for 
laying the foundation of the new episode. 

Episode B oundary Results As the present hjrpothesis predicted, NPs 
should be used at episode boimdaries to reinstate reference when attention shifts, while 
pronominals should be used within episodes to maintain reference when attention is 
sustained. This was exactly what happened in the experiment regardless of conditions. 
The episode boimdary results are shown in Tables 1 and 2, where data are calculated as 
Hits and Misses. Hits are NPs used at an episode boimdary plus pronominals used 
within an episode; Misses are NPs used within an episode plus pronominals used at a 
boundary. 





At an episode 


Within an 




Anaphora 


boundary 


episode 


Proportion of 


NP Pronominal 


NP Pronominal 


of Hits (%) 


Even 


75 0 


139 353 


75.49 


Odd 


76 3 


143 294 


72.71 


TOTAL 


151 3 


282 647 


74.10 (Average) 



Table 1: Episode Boundary Results in the Recall Task 
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Anaphora 



At an episode Within an 

boundary episode 

NP Pronominal NP Pronominal 



Proportion of 
of Hits (%) 



Even 
Odd 



210 30 48 371 88.16 
232 31 75 340 84.37 



TOTAL 



442 61 123 711 86.27 



(Average) 



Table 2. Episode Boundary Restilts in the On-line Task 



The hit rates of the two groups in each task are very similar (about 74% in the recall 
task and 86% in the on-line task). There is no statistically significant difference found 
within and across conditions. The results demonstrate that subjects managed reference 
in discoiu^e production following a general pattern. Their choice of anaphors reflected 
their discourse organization in the oral and written production task, which was partially 
controlled by their cognitive activities of memory and attention. 

4.3. Coimter-examples? 

On the other hand, the experimental results reveal that overall about 19% of 
tokens (26% in the recall task and 14% in the on-line task) seem to run counter to the 
boundary theory, i.e., NPs used within the episode and pronominals used at the 
boimdary. These counter-examples can also be^iccounted for by the present 
hypothesis, which is reported subsequently in this section. 

At the boundary: Inter-epiaode pronominals. Let us first examine the 
recall task (see Table 1). The inter-episode pronominals are very few (three cases, 
about 1% of all misses). In the on-line task (see Table 2), however, the inter-episode 
pronominals was about 33% (61 out of 184) of all misses . There are several possible 
explanations to account for the occurrence of these pronouns at the boundary. 

First, some subjects appeared to be more sensitive to original boimdaries than to 
imposed boxxndaries. They always marked an original boundary with NPs, but failed to 
mark the imposed ones firom time to time. They tended to keep the central character in 
focus and pronominalize them xmtil an original boundary was reached. This trend 
accounted for 52% (32 out of 61) of inter-episode pronominal misses in the on-line task. 

Second, about half of the subjects responded to both original and imposed 
boundaries for the first half of their descriptive task, marking both types of boundaries 
with NPs, but they seemed to overcome the imposed boundary gradually. By the time 
the last original episode was reached, the major character had been well established and 
many subjects overrode the imposed boimdaries, using pronominals to maintain 
reference through to the end. Such uses of pronomineds account for about 30% (18 out 
of 61) of inter-episode pronominal misses. On the other hand, it is interesting to see that 
though the central character was maintained with pronouns within the last original 
episode, non-central characters were almost always referred to by NPs regardless of 
the gender of the central and non-central characters. 
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Third, the other 18% (11 out of 61) of inter-episode pronouns all came from one 
subject who recalled the three episodes as if there were a single one. He did not mark 
any of the imposed and original boundaries except the first one, where they introduced 
the partiapants of the story with NPs. The reason these subjects failed to recognize 
episode boundaries is not clear at the moment. An explanation may be sought in the 
realm of individual differences in language/task abilities. 

Within the episode: Intra-episode NPa. Intra-episode NPs accounted for 
17% of all tokens (405 out of 2420) produced by the subjects in both tasks: 26% (282 
out of 1083) in the recall task and 9% (123 out of 1337) in the on-line task. Some non- 
ad-hoc explanations can account for these intra-episode misses, apart from cases of 
ambiguity resolution. 

First, as is evident from the discussion of "centrality," the protagonist of the 
story tends to be kept in focus and pronominalized within episodes. In contrast non- 
central characters are frequently nominalized, even when pronoun gender could 
distinguish between the central and non-central referents. This is illustrated in the 
example taken from a subject's written recall data. 

1) The child is walking on the street. Just as (he) goes around the comer, (he) sees a woman 
with two bags of grocenes in her hands. He steps forward and asks the woman if he could 
help. The woman is happy to give him one of the bags. He canies the bag and follows (her) 
Something in the bag is moving all the time. He is very curioas. As soon as the woman turns 
Uie corner and can t see him, he opens the bag. Out jumps a big lobster and scratches his 
tace. He Ues the bag up and quickly catch up with the woman. 

Here, lexical and zero pronouns were consistently preferred for the central 
character, with full NPs used for the non-central character even if gender could come 
into play. The differential use of anaphors between the central and non-central 
characters thus resulted in more NPs than expected (34% of all intra-episode NPs i e 
137 out of 405 NPs used within episodes). ' " ' 

Another phenomenon observed was the fact that more NPs were used within 
the first episode (in which ^ appears with another child) than within the other two 
There are two possible reasoiis for this trend: 1) at the beginning of the recall task 
subjects usually established and identified the participants with more NPs than ' 
expected, and 2) in the first episode, both participants appeared in each of the eight 
pictures and both took part in the activities together; subjects thus tended to weigh 
both characters equally for centrality. The following is an excerpt taken from a subject 
in the written recall condition. . 

2) This appears to be three short stories. In the first story, a little boy and a little girl are playing 
tennis in the backyard. The girl hits the ball first towards the boy , the boy geS it and hits it 
back toward the girl, ... 

Even the two characters ars of different gender, the subject chose to use full NPs to 
refer to each child throughout the retelling of the first episode. The first episode 
accounted for 28% (i.e., 1 14 out of 405) of all intra-episode NPs. 

The third factor to emerge was that in the recall task, the episodes are 
relatively longer and more complex than the imposed episodes in the on-line task and 
speakers were more often obliged to mark minor thematic discontinuity occurred within 

* Due to limited space the examples used in the paper are all rendered in English translation with 
the original anaphors (i.e., NPs. lexical pronouns, and zero anaphora) intact. ""'^"O". wiin 
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the episodes, e.g., changes of scenes, changes of participants, changes of perspectives 
or point of views within each episode. Subjects tended to use NPs to signal these 
changes, i.e., to treat them as indicating sub-episodes in the story structure. For 
example, a subject in the oral recall task produced the following excerpt. 

3) The third story is about the same boy. At the beginning, he is standing on (..) on a chair, 

holding a fly-swatter, about to hit a fly. The fly flies toward a chair (um..) a sofa by the chair. 
He waves the fly-swatter and aims at the sofa. But he misses the fly, and (..) and hits a pile 
of newspaper on the sofa instead. / Up sits a man suddenly from under the newspaper, (uh..) 
perhaps his father. Just as the man sits up, the newspaper is falling on the floor. The boy 
doesn't pay any attention (to the man), (he) rummages about among the newspaper on the 
floor, trying to find the fly. He then throws the newspaper all over the place, ... 

In this episode, when hits the newspaper, something surprising changes the 
perspective of the story. This signals a sub-boimdary or thematic discontinuity (as is 
indicated by a slash in the above recall data). At the beginning of this sub-unit, not only 
the new character is introduced by an NP, the re-introduction of the old character is 
also done by an NP. This type of in' ra-episode NPs accounted for another 33% of all NP 
misses (93 out of 282) in the recall task. 

Finally, in the on-line task, about 29% (i.e., 36 out of 123) of all intra-episode 
NPs were produced by the three subjects, who viewed and described the dual picture 
frame as if the two were presented individually. In other words, they overtly mentioned 
that they treated the single frame as two separate pictures during their production 
task, with phrases like "tiie picture on the left, ... the picture on the right", "the next 
pictxu'e, ... the next picture", "the first picttire, ... the second picture", etc. These 
subjects used NPs to reinstate a referent following their mention of the second picture 
in tJie frame. 

AU three subjects performed consistently throughout the narrative tasks, 
reinstating a referent between the two pictures as if there were a minor boundary 
there. Tomlin (1987) observes similar behavior in his experimental study and finds that 
some subjects overtly treated the dual slide presentation conditions as though the slides 
were presented singly throughout the task. 

The remaining intra-episode NPs may be attributed to idiosyncrasy. The non- 
central characters were normally referenced by NPs within the episode. However, some 
subjects used NPs and pronominals alternatively to disambiguate referents when the 
central character was of the same gender as the non-central referents. This usage, 
however, amounts only to 6% of all NP misses (i.e., 25 out of 405 intra-episode NPs). 



5. Written narratives 

While Section 4 focuses on the distribution of anaphora in both spoken and 
written narrative samples eUcited in the experimental condition, this section will 
examine written Chinese texts and explore the basic pattern of anaphora in popular 
Chinese novels/narratives. We argue on the one hand, that the general principle of 
anaphoric patterning proposed in the present study holds for both written and spoken 
Chinese narratives, and on the other hand, that some differences exist between 
speakers and writers in their use of anaphora because of some distinct characteristics 
of the two modalities. 
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The differences between spoken and written discoiirse have been explored 
since 1960s (Chafe, 1982; Havelock, 1963, 1971; Ong, 1977; Tannen, 1982, 1984; inter 
alia). Some studies have focused on particular differences or sets of related differences, 
and argued that the two modalities differ from each other in more ways than just the 
medium in which they are conveyed. Others have held that the differences between 
speaking and writing can be overridden when the context is appropriate. There are 
some styles of speaking which makes uses of features associated with writing, and 
some styles of writing which are more like speech. Beaman (1984), for example, finds 
that the spoken narratives are just as complex as the written ones: subordinate 
clauses frequently occur in spoken narratives as well, contrary to the findings of the 
previous studies, though they are different types and used for different discourse 
purposes. 

In this study, we consider the alleged structural characteristics of spoken 
and written discourse to be best represented by a continuum. Spontaneous 
conversation and formal academic prose would set up two poles on the continuum, and 
other styles of spoken and written discourse may be posited on various points of the 
continuxim, closer or farther away from the poles. Spoken and written narratives, for 
example, would be close to each other on the continuum, as Tannen (1984) claims: "all 
narratives, spoken or written, is modelled on the oral story-teUing genre" (p. 39) because 
they depend for their effect on interpersonal involvement between the writer or the 
character and the reader. The similar story-telling style, and hence the similar 
structural characteristics of written and spoken narratives would also reveal a similar 
pattern of anaphoric distribution between the two modalities. The experimental study 
discussed in the previous section has given evidence to the prediction. The naturally 
occurring (as versus experimentally elicited) written narratives, on the other hand, 
would yield the same basic pattern of anaphora since writers organize the discoiirse, 
empathize with their audience, utilize pragmatic and contextual information in a similar 
way as do speakers in their narrative production. 

5.1. The hierarchical structure of written narratives 

The present study argues that writers* hierarchical organization of written 
narratives, like spoken ones, governs their us^^ of anaphora to a large extent. The basic 
assumption underlying structural analysis of discourse is that speakers/ writers try to 
produce stories and conversations as separate but interrelated stractural vmits, and 
hearers/readers also try to represent incoming information in a group of hierarchically 
organized units. 

The major difference between spoken and written narratives in this study is 
that the discourse organization in narrative production is a speaker-and-hearer oriented 
process, but the discourse organization in written narratives is mainly reader-oriented. 
This results from different cognitive demands impoe;;d upon speakers and writers. As 
discussed earlier in the paper, the hierarchical organization of discourse is a 
manifestation of the limited capacity of himian cognitive resources: the spoken units, 
usually simple and short, are limited by short-term memory constraints, and also by 
speakers' empathy with hearers' cognitive capacity limitations. Writers, on the other 
hand, are relatively freed from cognitive constraints (as far as the final written product 
is concerned). Their production processes would be little affected by discourse 
orgtm^ization from their own point of view. Nevertheless, writers write for an audience. 
They would also try to organize the overall discourse into sizable, comprehensible vmits 
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of different levels because they know intuitively that language so packaged will be 
easier to process for their readers, who are more constrained than themselves by 
cognitive resources and have to process incoming information without a specific 
discourse plan. To ensure a successful delivery of what they write, writers try to help 
their readers build a discourse representation congruent with their own by forming 
hierarchical structural units along the linear path of discourse production. While lacking 
the opportunity for a direct interchange, writers employ various signaling devices to 
separate and link structural units. The alternative use of NPs and pronominals is one ot 
the signaling devices writers employ to cue comprehenders where and when a new umt 
starts. 

Previous work has been done on the structviral analysis of written story or 
narratives. Among numerous theories dealing with discourse representation and story 
comprehension, the theory of story grammar or story schema is most influential 
(Brown & Yule, 1986; Mandler & Johnson, 1977; Rumelhart, 1977; Thomdyke, 1977). A 
story grammar generally consists of a set of rules that describe how a story can be 
chunked into smaller units such as setting, episode, event, action, goal, consequence, 
etc , and how these units are related to one another. The approach of story grammars 
helps provide ways of representing knowledge stored in memory and how it relates to 
discourse understanding. However, such a story grammar appears to be appropriate 
only for short, simple and specially constructed texts. In analyzing naturally occurring 
narratives, the problems of the story gra mm ar (see especially Thomdyke, 1977, p. 79} 
become apparent: 1) the lower level components such as subgoal, event, attempt, etc. 
are so loosely defined that identifications of such categories in long, complex, natural 
narratives are extremely difficult; 2) it is not at all clear how recursive units such as 
episode and event differ fi-om or relate to one another, and how they relate to the overall 
structure of the narrative; 3) the set of rules defined in the story grammar are either 
too restrictive or too general to account for narrative units of different types. 

The present study proposes, in accord with the hypothesis proposed in Section 
3, that both written and spoken discourses are hierarchically organized into sequences 
of episodes. The general difference foimd in this study between the two modalities, 
however, is that episodes in the latter are simple, short, and similar in length and 
content, whereas episodes in the former are of varying length, complex, and have more 
layers of recursive units. Specifically, four levels of units are identified m wntten 
narratives- the overall discourse, macro-units (episodes), micro-units (subunits), and 
sentences/clauses, rangmg fi:om the highest to the lowest. These lower-level units are 
related to one another to maintain local thematic continuity, while they contribute to 
higher level theme to manage the global coherence of the discourse. Of these structural 
tmits, episodes are regarded as the core unit of discourse because they pertain both to 
global structure of discourse and to topically coherent parts of discourse. Moreover as 
illustrated in our experimental study, episodic organization in discourse processing has 
psychological content and is crucial to anaphoric patterning in written discourse. 

5.2. An analysis of ^vritten texts 

According to the present hypothesis, NPs would be expected to be used at 
episode boundaries to mark the beginning of a new structural unit, and pronominals are 
used within episodes to maintain thematic coherence of the unit. For our text analysis, 
episodes are first identified in written texts, and then the distribution of anaphora is 



examined to see if the written narratives used for the present study exhibit the pattern 
of anaphora predicted by otir hypothesis. 

As defined in Section 3, an episode is recursive in nature and subsumed 
under a macropropositions. Since each episode is subsiuned by a different 
macroproposition, topic changes would be expected to take place at the beginning of a 
new episode. Writers often use subtitles, chapter or section headings, or even blank 
lines to separate episodes and divide boundaries. The beginning of an episode is 
sometimes also cued by time or place phrases such as Friday, March 20; Three days 
after; Outside the restaurant; In the hospital, etc. Writers use these cues to signal the 
advent of a new episode, and readers depend largely on these cues to build separate 
substructures to represent episodes during comprehension. Moreover, NPs (more 
coding materials) would occur at the beginning of an episode to facilitate readers' 
construction of the new substructure since reference would be less accessible to them 
across episode boundaries. 

In the present analysis, episodes are identified roughly corresponding to 
chapters, sections, paragraphs in the written narratives, and episode boundaries are 
usually accompanied by chapter headings, sub-headings, blank lines, and adverbial 
phrases of some kind. NPs would occur in an episode accompanying one or more of the 
following parameters , which is/are employed to signal transitions between episodes. 

1. the first mention of a participant in an episode, and/or changes in 

2. time 

3. location 

4. topic 

6. participant 

In the remainder of this section, we wiU analyze one of the chapters 
randomly chosen fi-om each of the following three contemporary Chinese novels, and 
illustrate the general pattern of anaphora with examples taken from these written 
narratives. We will narrow our focus on human referents only, i.e., examine anaphora in 
its prototypical use-tracking a human participant through a discourse. 

1. The Aged (Cheng, 1991) 

2. The Years that Slipped By (Ye, 1982) 

3. The Leaden Wing (Zhang, 1984) 

The chapter from Cheng is composed of 20 episodes ( 1 1 pages), the chapter 
from Ye consists of 27 episodes (18 pages), and the chapter from Zhang contains 25 
episodes (13 pages). Generally, the hit rate (i.e., NPs used at the boundary and 
pronommals used within episode) is very high for all three chapters. They range from 
92% to 94%, with an average hit rate of about 93%. Specifically, almost 99% of 
pronommals are used within episodes to maintain thematic coherence, only 78% of NPs 
are used at the boundary. Special attention is thus paid to the analysis of the NPs. 

First, of the 78% of NPs occurring at the boundary (Hits), 8% are used for the 
farst mentions of participants in an episode. These can either be the first introductions 
or re-introductions of a referent. The foUowing passage takes the first few clauses from 
each of the three consecutive episodes in the selected chapter by Zhang (1984) All 
ttu-ee episodes focus on a m^jor discourse participant He Jiabin and referred to 
frequently by pronommals within each episode, yet the character is reinstated by an 
NP at tlie beginning of each episode. 
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4) He Jiabin looked sternly, even somewhat gloatingly, into the man's fat, greasy face, ... 
He Jiabin had many things on (his) mind as he made (his) way to Room 213, ... 

He jiabin had just got off work when (he) spotted Wan Qun at the gate, ... 
Secondly, about 12% of NPs are used at the beginning of an episode 
accompanied by adverbial phrases of time. Consider the following passage: 

5) In a large office, Zeng Huixin's desk was placed at an inconspicuous comer. She sat at the 
comer since (she) graduated from university, ... 

After a few years, Zeng Huixin had become a skillful editor. She still sat at the comer, ... 

Third, about 7% of NPs are used to mark thematic discontinuity alter 
changes indicated by adverbial phrases of place. The foUowing example provides an 
example. 

6) She (Xia Zhuyun) was a bit upset, thinking that the hairdressee was over friendly. 

Outside the beauty saloon, Xia Zhuyun took a glance at her watch. ... 

Next, another 16% of NPs are used at the beginning of an episode where a 
change in topic occurs. Changes in topic are of varyuig kind such as a shift from the 
description of one participant to another, a shift from a participant's appearance or 
personalities to authorial comments, or changes from action sequences to a character s 
inner thoughts, etc. An example of this kind is given below. 

7) She (Ye Zhiqiu) herself couldn't quite figure out why she would do it It may be because 
she could never be a mother in her life, (she) would Uy to seize any opportunity to show her 
love as a mother like all women in the world. 

For a woman, ugliness is certainly a misfortune. 

Taking individually, there was nothing wrong with Ye Zhiqiu's features, but these. 

features, viewed as a face, made her one of the few most ugly women. 

Finally, about 35% of the NP hits are used for shifts between two participants 
in a close interaction, especially in a dialogue. Dialogues in written narratives are often 
explicitly cued (about 70% of time) by the characters' names, although some (about 
30% of time) are not cued at all to intensify the effect of making the reader a closer 
onlooker of the progressing events. The foUowing dialogue provides an example of the 
formei ase. 

8) Du Jianchun asked: 'Tell me, how did you c.<^me to this remote area?" 
"I?" Ke Bizhou hesitated, stumbling: "You, you want to know the tmth?" 
"Of course!" Du Jianchun was much surprised: "Who'd like to hear lies?" 

Ke Bizhou was somewhat uneasy, he said dryly: "I came here not of my own free will 
"What!" Du Jianchun cried loud and cut him short. ... 

Although the two characters are of diflFerent gender, the dialogue is still cued by 
proper names rather than pronoims. 

While inter-episode pronominals are negUgible (about 1%), intra-episode NPs 
accounts for about 22% of all NP tokens. Of these NPs misses , about 8% are NPs used 
within episodes to disambiguate referents, where two participants of the same gender 
are involved. However, there still exists a pattern of referential choice between the two 
characters. Lexical pronouns are generally used to refer to the cvurently more topical 
character (from whose point of view the passage is oriented), while NPs are used for the 
less topical character. For example,. 
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10) Shi Quanqing considered He Jiahin stupid. He had worked with liahin for many years. 
During all those years, he had watched He stumble time and again (politically), he had 
spotted every obstacle in He's way, but he had never once alerted U& of the danger; he 
couldn't wait to see He fall flat on (his) face. 

Another 6%ofNP8 are used within episodes to mark perspective or point of 
view changes of the author and/or discourse participants. This is illustrated by the 
following passage. 

11) Her (Zeng Huixin) talent was recognized by the group. Some famous writers' work were 
put on her desk for translation, and some hard-to-translate phrases and sentences were also 
sent to her for solution. She was like Cinderella discovered. Even her reticence made her 
more attractive than ever. 

But Zeng Huixin was still single, nor had (she) got a boy-friend. She was by nature 
proud and aloof, ... 

Here in this passage, the perspective changes from how the character is evaluated 
by other people to how the character is by herself. At this transition point, an NP is 
used to indicate the change. 

To svumnanze, in the written narratives analyzed above, NPs are used at the 
beginning of an episode (together with other cues) to trigger readers to shift and initiate 
new substructures so that they can represent each episode in its own substructure, 
and they are sometimes also used within episodes to mark perspective or point of view 
changes, or resolve referential ambiguity between discourse participants. Altogether, 
these usages account for 92% of all NP tokens found in the three chapters selected for 
the analysis. 



6. General Discussion and Conclusion 

The present study proposed a model of anaphoric choice in which cognitive 
constraints, discourse organization, and pragmaticAnterpersonal factors interact to 
control a speaker/writer's referential decision during discourse processing. The model 
not only illustrates the general rule of anaphoric patterning in narrative production, but 
also predicts alternative uses of anaphora at specific places. 

The findings support the general hypothesis proposed in the present study. 
First, the experimental results demonstrate that episodic organization of narrative 
production has psychological content: the story was hierarchically organized and 
remembered as a series of episodes. The psychological reaJity of episodes provides a 
soimd foundation for the episode theory explored here. Second, the episodic structure of 
discourse largely governs a speaker/writer's anaphoric choice. Both the experimental 
results and the text analysis show that speaker/writers are sensitive to episode 
boimdaries. They use more marking materials (NPs) at episode boundaries where more 
cognitive resources are demanded, and they use less marking materials (pronominals) 
within episodes where thematic coherence is maintained. Third, the thematic 
discontinuity within episodes such as change of perspectives, possible worlds, and 
ambiguity resolution also demands more coding materials. Besides the general 
characteristics of anaphoric distribution in both spoken and written narratives, 
dififerences also exist between the on-line oral production of stories and written 
narratives. First, the dififerent cognitive demands imposed on speaking and writing 
makes a writer's discourse organization (and hence anaphoric choice) even more 
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audience-oriented. While the speakers' use of anaphora was a manifestation of both 
their own cognitive processes and an assessment of their hearers' current knowledge, 
the writers' major concern was to help readers build a hierarchical representation of 
discourse congruent with their own by their patterning of anaphora. Second, the 
episodic structure in the written narratives was more complex, recursive than that in 
the oral story-telling, and the writers therefore were more likely to employ NPs to 
create subboundaries for their readers to facilitate comprehension. Third, while the 
speakers consistently pronominalized human central characters of the story in the 
relatively simple oral production task, writers' view of central/topical characters 
changed from episode to episode. Nevertheless, it was still the currently topical referent 
who received attention and remained in focus, and was hence more likely to get 
pronominalized. 

The present study gives further evidence that while stories and texts may be 
presented or produced in a linear fashion, they are nevertheless formulated and 
processed hierarchically. This hierarchical organization of discourse is constrained in 
part by the cognitive processes of memory and attention. In this process, the episode 
serves as a basic unit in production as well as in comprehension. The alternative use of 
NPs and pronominals is a very important device to represent discourse structure in 
production and facilitates the restructuring of discourse representation in 
comprehension. The correlation between discourse organization and anaphoric 
patterning has provided an informative method of investigating the relationship 
between language and cognition. 
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1.0. Introduction. The study of tone has figured prominently in Sino-Tibetan linguistics, for a 
very good reason: the majority of the extant languages in this family' make distinctive use of pitch- 
related phenomena of one type or another. Considerable progress has been made in recent 
phonological investigations into the tonology of Sinitic languages (e.g. Yip 1980, Yue-Hashimoto 
1987, Shih 1986, Bao 1990). Comparable studies on the highly diversified and relatively under- 
explored Tibeto-Burman languages are however still scanty, with a few outstanding exceptions such 
as Mazaudon 1977, Michailovsky 1988, and Weidert 1987. The present study intends to contribute 
to a better understanding of the typology of tone in Tibetan, one of the principal languages of the 
Tibeto-Burman family. 

An overview of the attested types of tonality in modern Tibetan in §1 puts the paper in 
perspective; the particular tone system of Lhasa, representing a relatively advanced tonogenetic stage, 
is then briefly described. In §2 Tibetan tonology is explored from the vantage point of 
autosegmental phonology, a framework which holds special promise in elucidating tone in Tibetan. 
The particular autosegmental account of tonal phenomena in four Tibetan dialects given in Duanmu 
1992 is critically examined in §2.1; a more comprehensive and explanatory reanalysis is offered in 
§2.2 which diverges from the foregoing with respect to (i) the representation of the underlying tones, 
(ii) the source of the redundant high tone on non-initial syllables, and (iii) the role of tone-spreading 
in Tibetan tonology. Next, two issues involved in the proposed analysis are further explored, 
bearing respectively on the high tone as the 'default' tone in Tibetan (§3.1), and the problem of 
whether word-level melody is derived from syllable tones through 'tone sandhi' (§3.2). Baied on 
the findings of this paper, a typological distinction is suggested in the concluding section between 
template word-tone languages represented by Tibetan and Dongkou Chinese, and contoureme 
word-tone languages represented by Tamang and New Shanghai Chinese. 



* 

The main idea.s in this paper have been presented in a linguistics colloquium at the Institute of History and 
Philology (May 93) and at an invited talk delivered at the Graduate Institute of Linguistics, National Tsing Hua 
University (June 1993). Thanks are due to Randy J. LaPolla, Duanmu San, C!iiu-yu Tseng, Dah-an Ho. Pei-chuan 
Wei, Chin-fa Lien, and Kuang Mei for their constructive queries and comments on the data and analysis presented 
herein. 

'tHc Sino-Tibetan language family contains at least two subfamilies, Sinitic (Chinese) and Tibeto-Burman. 
According to the more conservative and increasingly popular view in the field, the Miao-Yao and Tai-Kadai 
languages are not genetically related to Sino-Tibetan. 
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1.1. Tonality in Tibetan. Tonality is generally speaking under-developed in Tibetan, as in some 
other Tibeto-Burman languages.^ It is generally held that Old Tibetan, was not a tone language, in 
view of the complete absence of tone-marking in the traditional Tibetan script dating from the 
seventh century, and a fortiori in view of the existence of modem dialects which remain atonal to this 
day. Modem Tibetan, on the other hand, presents such a variegated scenario of tonal developments 
that the simple dichotomy of 'tonal' versus 'atonal' dialects seems insufficient. It would be more 
realistic to plot modem Tibetan dialects along a scale of increasing tonality, ranging from completely 
atonal to relatively highly tonal as exemplified in Table 1 below, based in part on Huang 1993: 



Tonality Scale 


Description of Each Stage 


Representative 
Dialect Points 


Atonal 
A 

Y 
Tonal 


neither phonemic tone nor 
redundant 'habitual' tone 


Ndzorge; Ngaba 


no phonemic tone; 
redundant 'habitual' tone 
developed 


Labrang; Daofu 


tone phonemic in restricted 
environments only 


Amdo Sherpa; Balti 


tone generally phonemic; 
tone values unstable in some 
syllable types 


Derge; Yushu 


tone values stable; high 
redundancy 


Lhasa; Gar 


additional contrast between 
falling and level contours 
established 


Shigatse; Dzongkha 



Table 1: The Tibetan Tonality Continuum 



At one end of the above scale are found dialects in which all syllable types carry a high 
(falling) tone when uttered in isolation, much as in English. This is, of course, the completely 
atonal stage, represented by such Amdo dialects as Ndzorge (mDzod-dge; Sun 1986),-' Amchog (a- 
mchog; Wu 1983), and Ngaba (rNga-ba; Huang 1993); old Tibetan, in all likelihood, also belongs to 
this type. The next stage is marked by the genesis of 'habitual tone' (Hu 1980: 31) or 'natural 
tone' (Huang 1993), i.e. fixed redundant pitch patterns determined by the voicing state of syllable 
initials, with voiced initials conditioning low pitch and voiceless ones conditioning high pitch, in 
such Amdo dialects as Labrang (bLa-brang; Hua 1980:72, Hu 1980:fn. 20) and Daofu (rTa'u; 
Huang 1993).'* Tone, however, did not become contrastive until the emergent-tone stage where a 
limited number of tonally distinguished minimal pairs began to enter the scene. Two subtypes 
exemplifying this stage can be identified; contrastive tones are either restricted to certain syllable 



Chinese-like, or omnisyllabic (Matisoff 1991 :491 ) tone systems where all syllables normally carry contrastive tone 
arc lacking in many Tibeto-Burman branches, such as Tani, West Himalayish, Bodo-Garo, and Bodic (including 
Tibetan). 
3 

Written Tibetan (hereafter WT) forms will be given in Wylie's standard system of transliteration. 

"^Hence the slogan /<[ ^T, >)5 i6. 'High pitch if voiceless; low pitch if voiced'. It is often implied that this slogan 
can be applied to all Amdo dialects (Hu 1980: 31 ; fn. 20; Hua: 1980:72-3), and even to Old Tibetan also (Hu 1980: 
31). One of the important contributions of Huang 1993 is to dismi.ss this misconception by pointing to the existence 
of both types of Amdo dialects (e.g. Ngaba vs. Daofu). See also §3.1 below. 



ERIC 



-354- 



types, such as those with nasal initials in tl.e case of Amdo Sherpa (a-mdo Shar-pa; Nagano 1980), 
or apply only to disyllabic and trisyllabic nouns as is the case in Balti (sBal-ti; Sprigg 1966: 186- 
9).^ Yushu (Yus-hru'u), along with such other varieties of Khams Tibetan as Derge (sDe-dge; Qu 
1979:121; Huang 1993: 3) and Chamdo (Chab-mdo; Liu 1984), embody the next stage of tone 
development, with distinctive tones on most syllable types but variable and hence non-distinctive 
pitch patterns on others (see section below). Then came the stage represented by Lhasa as well as 
many other varieties of tonal Tibetan, where contrastive tones have permeated to all syllable canons 
but such tones contain a high degree of redundancy, being multiply realized by such features as 
phonation type, final glottality, tensity, syllable quantity, as well as pitch. The niost advanced 
tonogenetic stage in Tibetan is reached by such dialects as Shigatse and Dzongkha, where a new 
distinctive (steep) falling pitch arises in compensation for the apocopated glottal coda in the case of 
Shigatse (Qu 1981a:186-7; Huang 1993) or sonorant-coda apocopy as well as syllable contraction 
in the case of Dzongkha (rDzorig-skad; Mazaudon and Michailovsky 1988), making it necessary to 
recognize both a register (high vs. low) and an intersecting contour (falling vs. level) contrast. 

One of the most important generalizations on Tibetan tone, even in its most advanced state, is 
that the primary register contrast is realized only on the initial syllable of a phonological word; all 
other syllables are normally on predictable high register.^ The drastic reduction of tone in 
multisyllabic domains results in at most one contrastive tone per (phonological) word m 
Tibetan, regardless of the number of constituent syllables. On account of this fact, there is now 
growing consensus that what (tonal) Tibetan has is a word-based rather than syllable-based tone 
system (Sprigg 1954, 1955; Mazaudon 1977; Ossorio 1982:2.5.6; Shih 1986: §4.5). 

1.2. Tone in Lhasa Tibetan. Although Lhasa is the best-known variety of modern Tibetan, some 
areas in Lhasa pijonology, in particular its tone system, remain controversial. A number of factors 
are responsible for this lack of consensus. First, not all sources on alleged 'Lhasa Tibetan' 
represent genuine samples of the native speech of the Lhasa city. Second, elicitation methods which 
make no provision for the pronounced stylistic differences in Tibetan may yield controversial results 
(Sprigg 1992, 1993). Moreover, how one should properly handle multiple phonetic realizations of 
tone and tonal neutralization in non-initial syllables mentioned above contributes further to divergent 
interpretations of Lhasa tonology. 

To begin with, examine the following table of the citation pitch patterns of Lhasa 
monosyllables reported in Hu 1980 and Hu et al. 1982, based on an instrumental study of the 
colloquial-style pronunciation of three native speakers:^ 



^Huang 1993:2 reports a few minimal pairs on monosyllables also in her Balti consultant's speech. 

^Also to be included in this type are such other varieties of Central Tibetan as Langkazi (sNang-dkar-rtse) (Qu 

1981a), and Shap (Ossorio 1982). 

^Except unstressed clitic syllables and a minor case to be discussed in §3.2.2. 

^Pitch patterns are given in the familiar numerical tone notation (highest pitch level = 5; lowest pitch level =1). See 
also the instrumental study reported in Kjellin 1977, which yielded comparable results. However, Sprigg 1993 argues 
against the citation-form approach, warning that literate Tibetans may give spelling-style pronunciations when 
uttering syllables in isolation. However. I have had quite different personal experiences working with my literate 
Amdo Tibetan consultant, who. keen on the stylistic differences, has no difficulty whatsoever enunciating citation 
forms in the colloquial-style on demand (sec Sun 1986: Chapter 4). It would be only fair to point out that the 
linguists conducting the experimental study reported in Hu et al. 1982 were also fully aware of stylistic distinctions 
in Lhasa Tibetan, and explicitly states: 'this experiment was based entirely on the colloquial pronunciation.. .as natural 
in fluency and tempo as in normal daily conversation as possible...' (Hu et al. 1982:23. translation mine). 
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Register 


WT Form 


Lhasa Form 


Pitch Pattern 


Gloss 


mcH 


bka' 




high (slight) falling 


'decree' 


ka-ba 


ka:55 


high level 


'pillar' 


bkag 


ka?52 


high (steep) falling 


'hinder' 


skam 


kaim^s 


hijh level 


'be dry' 


bskams 


kam?52 


high (steep) falling 


be dry (perfective)' 


LOW 


sga 


kai2 


low (slight) rising 


'saddle' 


bsgar 


ka:"3 


low level-rising 


'fasten' 


'gag 


ka?i32 


low rising-falling 


'be clogged' 


gam 


ka:m"3 


low level-rising 


'box' 


'gams 




low rising-falling 


'put in mouth 
(perfective)' 



Table 2. Lhasa Monosyllabic Citation Pitches 

Several observations can be made about the preceding data: 



(1) 



a. 

b. 
c. 
d. 



High-register syllables are characterized by a fall in pitch, and low-register syllables by a 
rise in pitch. 

On long syllables, pitch movements are flattened. 
The glottal stop coda -? induces a steep drop in pitch. 

There is at most a two-way register contrast, high versus low, on any of the five rhyme 
types in Lhasa (-V, -VV, -VP, -VM, and -VMP; where V = vocalic nucleus; P = stop 
coda, including the glottal stop -?; M = non-checked or sonorant coda). 

The six complementarily distributed pitch patterns in Lhasa, therefore, leave much room for 
different tonemic interpretations,^ four of which are summarized below (Hu 1980:23-4): 

(2) Four-tone analysis A:'" Marks glottal stop; regards syllable quantity as an inherent 
feature of tone (speaking thus of 'long tones' vs. 'short tones'): 



ka54 

ka:^^ 
ka?52 

kaim^s 
kam?52 



•> ka53 
•> ka^s 
•>ka?53 



'decree' kai2 

'pillar' ka:"3 

'hinder' ka'Pisz 

>kam55 'be dry' ka:m"3 

> kan'PSS'be dry (pf)' kam'Pisz 



-> ka^s 
->kai5 



'saddle^ 
'fasten' 
>j^a?35 'be clogged' 
>kami5 'box' 

>kam'?35 'put into mouth (pf)' 



(3) Four-tone analysis B: Represents syllable quantity .«=egmentally (quantity in syllables 
closed by sonorant codas are not marked); gives falling tones tonemic status (tone marks: -f = high 
level; -h = high falling; -v = low rising; -w = low rising-falling): 



^Thus. the statement that 'Lhasa Tibetan has six citation tones.' (Shih 1986: 19) is valid only at the phonetic level. 

This is the system used by Tibetologists affiliated with the Chinese Academy of Social Sciences, such as Hu Tan 
Qu Aitang, Zhang Jichuan, and Tang Kerang. . ^u^n a.s nn lan. 

This is the system devised and used by Tibetologists from the Central University of Nationalities in Beiiinc 
Kitamura and Nagano 1990 adopts a similar transcription system for L hasa Tibetan whic-h. however is word based 
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-> gaf 


'decree' 


kai2 


-> gav 


'saddle' 


kai^^ 


-> gaaf 


pillar 


»113 


-> gaav 


raSien 


Ka i'^'" 


-> gaxi 


ninuer 


V-Q 0132 


^ fTQ T.T 

-> y OL W 




kaxm^s 


gamf 


'be dry' 


ka:ia^^3 


-> gamv 


'box' 


kam?52 


-> gamh 


'be dry (pf)' 


kam?i32 


-> gamw 


'put into mouth (pf)' 



(4) Four-tone analysis C:'^ Represents length segmentally; gives falling pitch tonemic status. 
Unlike analysis B, this system recognizes only two register tones on short syllables, but posits an 
additional falling pitch in combination with the two registers on long syllables,'"' yielding four tones: 
high-high, low-low, high-falling, and low-falling. 



ka^i -> qa 


'decree' 


kai2 


-> qa 


'sari flip' 


kax^s -> qaa 


'pillar' 


kax"3 


-> qaa 


'fasten' 


ka^s^ _> qgg^ 


'hinder' 


ka'Pi32 


-> qaa 


'be clogged' 


kaxm^s -> qam 


'be dry' 


kaxmii3 


-> qam 


'box' 


kam'JS^ _> 


'be dry (pf)' 


kam?i32 


-> qam 


'put into mouth (pf)' 


(5) Two-tone analysis: 


''^ Marks both glottality and quantity 


segmentally; regards 


registers as proper features of tone. 








kasi ->ka 


'decree' 


kai2 


-> ka 


'saddle' 


kax^s ->kaa 


'pillar' 


kaxii3 


-> kaa 


'fasten' 


ka?52 .>]^a? 


'hinder' 


ka?i32 


-> ka? 


'be clogged' 


kaxm^s -> kam 


'be dry' 


kaxm^^3 


-> kam 


'box' 


kam'?^^ _> ]^am'? 


'be dry (pf)' 


kam? 132 


-> kam? 


'put into mouth (pf)' 



Most phonologically defined words in Lhasa Tibetan are more than one syllable long. They 
include, in the main, nominal and verbal stems plus their corresponding endings, and disyllabic (and 
sometimes trisyllabic) compounds. Phonological words are characterized by a number of internal 
sandhi phenomena such as presence of certain medial 'intrusive' consonants,'"'' vowel harmony, 
deaspiration of stop/affricate initials, voicing of second-syllable voiceless sonorant initials, and above 
all, tonal modulations.'^ Table 3 below lists the six surface pitch patterns pronounced in isolation 



1 2 

This system, designed by Chang Kun and Betty Shefts Chang (Chang and Shefts 1964; Chang and Chane) and 
adopt;; n a number of influential teaching materials on Lhasa Tibetan by John Goldstein, is by far the best-known 
systfi'i n use outside of China. 

' ■^Sy.'liibles with the glottal stop coda are represented in this system as long syllables. This has to do with the fact 
that -'7'? is often realized as -VV in the first syllable in multisyllablic words in Lhasa (Qu 1981a:191-2). 
Moreover, according to Hu 1980; fn. 13, some Lhasa speakers pronounce all glottal-coda syllables as long open ones 
(Rinzin Wangpo, R. K. Sprigg's main Lhasa Tibetan consultant, is one such speaker). 

'"^This is the two-tone analysis of Lhasa Tibetan advocated in this paper. 

'^These are the remnants of Old Tibetan consonant clusters, e.g. in the Lhasa word mezita 'firearm' (< me Tire' + 
ta 'arrow'), the medial nasal -n- is a reflex of the nasal preradical in- of the second morpheme ta (< WT mcla') 
See Ossorio 1982 5.1 .4; Sun 1986; 4.4 for more det;..!s. 

"^Thcsc are the phonetic exponents of the intcrvcrbal junction (i.e. close juncture) prosody (Sprigg 1954: 146-9).. 
For a different set of sandhi devices in the atonal Ndzorge Shseme Kara dialect, see Sun 1986: Chapters 3 & 4. 
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and the respective modulated pitch shapes when these occur in the first, medial, and final syllables in 
mulUsyllabic words'"' (based on Hu 1980): 



Monosyllabic Tone 
Value 


Tone Value in Multisyllabic Words 


First Svllahle 


Medial *svllflhlp 

(if any) 


"lllcu oyilaUlC 


54 


55 


55 


54 


55 


55 


55 


55 


52 


55 


55 


52 


12 


11 


55 


54 


113 


11 


55 


55 


132 


11 


55 


52 



Table 3. Lhasa Pitch Patterns in Multisyllabic Words 



Again, several observations can be made: 

(6) a. In the first syllable, the characteristic fall and rise in pitch associated respectively with 
high- and low-register syllables are not observed; the attested pitches are level in both 
cases (high level and low level "). 

b. In the medial syllable, if any, the pitch is always high level^^. 

c. In the final syllable, only high-register pitch patterns are found. 

Thus, multisyllabic words in Lhasa, as in most other tonal dialects, carry a two-way contrast 
only in the first syllable, all subsequent syllables are predictably high-pitched. Obviously, the tone 
system at work in Tibetan is a highly restricted one, differing in fundamental ways from a typical 
Sinitic system (see §4 below). 

2.0. The Autosegmental Approach to Tibetan Tonology. Autosegmental phonology, an 
offshoot of non-linear phonology developed by John Goldsmith and others (Goldsmith 1979, 1990; 
Yip 1980), is the descriptive model adopted in a recent article by Duanmu San on the analysis of 
tone in modern Tibetan (Duanmu 1992), where it is contended that an autosegmental approach to 
Tibetan tone can bring out insights missed by the traditional, syllable-based approach. 

Indeed, autosegmental phonology seems a particularly fitting framework for the insightful 
treatment of tone in Tibetan, given the characteristics of Tibetan tone mentioned above. First, it is the 
contrast in (high vs. low) pitch register that is fundamental in Tibetan, whereas pitch contours are 
often redundantly associated with suprasegmental parameters Thus, a high tone in Lhasa falls 
slightly, stays level, or falls steeply depending on whether the tone-bearing syllable is short, long, or 
checked/glottalized. Dialects also differ significantly with respect to how surface pitch contours are 
realized (see Table 5. below). As has been amply demonstrated in previous work on African tone 



Multisyllabic words in Tibetan are at most three syllables long. Quadrasyllahic expressions in Tibetan behave 
tonally as combinations of two disyllabic words (Qu 1981b:21). In the Lhasa .sy.stem. the pitch of a long second 
syllable is rising rather than level if the first syllable contains a low tone in multisyllabic words. Moreover, 
unstressed syllables also behave differently (see §2.2 below). 

'^Subsequent reference to this source will be by data-set number and page number only. 
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languages, the autosegmental approach is a particularly fitting framework for handling register-tone 
systems. Moreover, autosegmental phonology allows both general and localized tone processes in 
Tibetan to be characterized in a revealing way. Consider for example the pervasive reduction of tone 
in Tibetan non-initial syllables, resulting in highly restricted tone patterns in multisyllabic words. 
Instead of exhaustively listing individual pairs of citation tones and the respective 'sandhi tones' as 
is done in the traditional approach, an autosegmental analysis can reflect the simplicity of the 
underlying tonal neutralization process by formulating a simple unitary tone rule which nullifies the 
underlying tones on non-initial syllables, leaving only the initial tone to bear the tonal contrast of the 
entire word (see §2.1 and §2.2 below). 

Concurring with Duanmu's general points on the usefulness of the autosegmental model in 
representing tonal oppositions and processes in Tibetan, we nevertheless hold rather different views 
regarding what constitutes adequate autosegmental treatments of Tibetan tone, for reasons we shall 
see below. 

2.1. Duanmu's Analysis. In Duanmu 1992, tonal variations in monosyllabic and multisyllabic 
expressions in four Tibetan dialects, Lhasa, Zedang (rTsed-thang), Gar (sGar) and Gaize (sGer- 
rtse),'^ are examined in order to show that tone in Tibetan behaves in ways similar to tone in other 
parts of the world, such as Africa and east China (Wu dialects of Sinitic), in that (i) contour tones 
are made of level tones, and (ii) tones lie on an independent tier and may spread across segments. 

For Duanmu, all four varieties of Tibetan have the same system of underlying syllable-tones: 
a high (H) and a rise (LH), differing from each other mainly in their tone-mapping rules. The set of 
tone rules proposed by Duanmu for Lhasa are repeated as (7) below: 

(7) a. Delete tones from non-initial syllables. 

b. Associate tone to syllables one-to-one, left to right. 

c. If there are more syllables, spread the last tone to excess syllables. 

d. If there are more tones, link excess tones to the last syllable. 

e. If a L precedes a final long syllable with a H, spread L to the latter. 



The relations between the underlying tones and their realizations in different syllable types 
are as in Table 4 below (adapted from Duanmu op. cit.: 75): 





Underlying Tone 


Syllable Type 


H 


LH 


Realization 


54 


12 


-V (open syllable) 


55 


113 


-VVV -VM (long syllables, 
including long open syllables 
and closed unchecked 
syllables 


52 


132 


-VP (checked syllables)^" 



Table 4. Lhasa Surface Pitch Patterns and Underlying 
Tones in Duanmu's Analysis 



'^UiHsu, Zedang, and Gar are Central (dBus-gTsanp) dialects, whereas Gaize belongs to the heterogeneous Khams 
dialect group, according to Qu and Tan 1983. 

^^Not mentioned is the rhyme type -VM?, which, in contrast with -VM, behaves tonally as a short checked syllable. 
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Consider now the sample derivations of the monosyllable ka:"3 'fasten; install', and the 
disyllabic compound me"po:"3 'coal-pan' (composed ofmei2 'fire' andplio:"^ 'bowl'): 

(8) 7b 7d 

ka: > ka: > karris 

/ /\ 
LH LH LH 

(9) 7a 7b 7e 

me-po: > me-po: -> me-po: > meii-po:ii3 

LH H LH L H I 

Only two aspects of the multisyllabic tone patterns in the three non-Lhasa tone systems 
where they diverge from the Lhasa system, are treated by Duanmu. For one thing. Gar and Gaize,' 
unlike Lhasa, lack the tone-spread rule 7e. Contrast (9) above with (10) below, showing the 
derivation of the compound for 'coal-pan' in Gar: 

(10) 7a 7b 

me-po:r > me-po:r > me^i-poirss 

LH H LH L H 

The second divergent pattern pertains to trisyllabic compounds in Zedang, where the medial 
tone, rather than being invariably high-toned as in the other dialects, becomes low if the tone of the 
first syllable is low. This fundamentally assimilatory process is accounted for by appealing to 
'edge-in association', expressed as (b) and (c) of the following Zedang tone rules (p. 83): 

(11) a. Delete tones from non-initial syllables. 

b. Associate the first tone to the first syllable, and the last tone to the last syllable. 

c. If there are free tone in between, spread the first tone to them. 

Consider the sample derivation for the compound word for 'cadre' below: 

(12) 11a lib lie 

le-tpe-pa -— > le-tpe-pa — > le-tpe-pa — -> leii-tpeii-pa^s 



LH LH H LH L H L 



H 



er|c 



Ingenious as it may seem, Duanmu's analysis of Tibetan tone falls short of being 
completely satisfactory. On the one hand, what he advocates for Tibetan is a typologically odd 
system ot underlying tones. Given a two-tone system, it is in principle far more natural to have a 
simple contrast of high vs. low registers than a mixed system of level (H) vs. contour (LH) tones 
especially in view of the high variability of pitch contours in tonal Tibetan (Sprigg 1993).2' 

1990 '|c)S['^y^r' rl'T^?^ underscored the fact that Tibetan is a register tone system (see for example Sprigg 
u;0 U93). Y. R. Chao dso recognizes the two Lhasa tonemes as a basic contrast of high vs. low registers even 
though he describes their actual citation values as contour tones (respectively high falling 53 and low rise-fali 131) 

Snnif^lTn 'mm'M^^' O^hcr Tibeto-Bumian languages with similar two-term register tone systems include 
PaTani (Sa.xena 1991 ), Apatani (Weidert 1987: §6.2) and Manipuri (= Meithei; Chelliah 1991 ). 
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We noted earlier that in Lhasa (as well as in many other Central Tibetan dialects), high- 
register syllables are characterized by a fall in pitch, and low-register syllables by a rise in pitch 
when uttered in isolation. To accord the pitch rise associated with the low tone underlying status, i.e. 
LH, while relegating the pitch fall associated with the high tone to 'domain-final intonation', 
representing the later simply as H, seems rather contrived. The correlation between underlying and 
surface tones would be much more consistent if both the pitch fall (with high-register syllables) 
and the pitch rise (with low-register syllables) are regarded as low-level domain-final phenomena, to 
be dealt with unilbrmly by langut'.ge-specific a'lotonic rules. 

In fact, this heterogeneous system of underlying tones is extended by Duanmu not only to 
Zedang and Gar whose surface tonal phonetics is akin to that of Lhasa, but even to Gaize, which has 
entirely disparate monosyllabic pitch patterns, as shown below: 



H 


LH 


Syllable Type 


53 


31 


short 


51 


22 


long 



Table 5. Gaize Underlying and Surface Tones in Duanmu's Analysis 



In Table 5, the high tone on long syllables is a steep falling tone while the low tones do not rise at all 
in Gaize, contrary to what the proposed underlying tones H and LH indicate. The obvious mismatch 
between the underlying and surface tones is dealt with by Duanmu by adding a patch-up rule which 
tags a L to the right of monosyllables ((38d), p. 81), and, in the case of the low-toned syllables, 
stipulating further that a H tone sandwiched between two L's may 'stay unlinked'; for instance ((39 
b.), p. 81): 

(13) Tone Association L-insertion 

tar ~> tar > tar 22 

1 A 

LH LH L(H)L 

It is evident now that Duanmu pays a high cost in descriptive naturalness and plausibility for 
treating the Tibetan low-register tone as underlyingly LH. This decision is presumably motivated by 
an important fact related to tonal phonotactics in Tibetan, which, we recall, is that in non-initial 
syllables of multisyllabic words the tonal contrast is neutralized to a relatively high-pitch tone in all 
tonal dialects of Tibetan so far recorded. Under Duanmu's analysis, this state of affairs is 
accounted for by attributing the non-initial high-tone to a H emanated from both underlying tones 
on the initial syllable through left-to-right tone-spreading. 

There are, however, indications that this conception of the origin of the (redundant) nor- 
initial high tone is misguided. First, the requirement that all Underlying tones have a H on the right 
edge demands in effect that all varieties of tonal Tibetan have only two kinds of underlying tones, 
high level (H) and rising (LH). This stipulation flies in the face of such surface pitch patterns as 
falling (HL) in the high-register as well as level (L) and falling (HL) in the low register actually 
attested in many modern dialects (Qu 1988: 327). We have seen in the above how much 
Procrustean stretching has to be exercised in order to fit the data into Duanmu's theory of 
underlying tones in the case of Gaize; even more ad-hoc manipulation will have to be performed if 
other dialects are taken into consideration. Second, treating the low-register tone as LH misses the 
underlying unity of certain tonal processes. Consider again the pitch patterns me^-poi^^^ 'coal- 
pan' in Lhasa and le^^-tpe^^-pa^^ 'cadre' in Zedang, where the second syllables become 
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respectively rising and low level when abutting a low-register tone in the first syllable. Intuitively, 
what is clearly at work here is the low pitch of the initial syllable permeating, to different degrees, the 
neighboring syllable. However, the relatedness between these two cases of low-tone assimilation 
is obscured in Duanmu's analysis, which handles them by distinct tone-association rules. On the 
other hand, by representing the low-register tone simply as L directly captures the underlying 
uniformity of these two tonal processes (see below). Furthermore, the representation of the low tone 
as underlyingly LH yields predicted tonal outputs directly contradicted by actual tone patterns. 
Observe, for example, the following representations of the morpheme maa 'butter < WT mar ' both 
in isolation and in the compound tp^a-maa 'tea and butter < WT ja-mar ' under Duanmu's 
analysis: 



Since both occurrences of maa 'butter' bear the same LH tones, the prediction is that their surface 
tones should also be identical. On the contrary, the instrumental research conducted by Hu et al. 
(1982: 34) reveals that the morpheme for 'butter' has a higher general pitch (24) in the compound 
'tea and butter' than its citation pitch ("3). If, on the other hand, the underlying citation tone of 
'butter' is posited simply as L, then the two occurrences of 'butter' will have distinct tone 
structures, L vs. LH: 

(15) ma: tpl^a-ma: 



Crucially, the presence of a H tone in the word-internal occurrence of the morpheme for 'butter' (on 
the provenance of this H tone, see below) provides a natural explanation why the pitch gets 
heightened in this particular environment. This provides further, and in our opinion clinching, 
evidence that the underlying low-register tone iti Tibetan should be no more complex than L, and 
that the high pitch of non-initial syllables in Tibetan is by no means inherited from the initial 
syllable. 

2.2. Alternative Analysis. From the above arguments, and also in compliance with the insights 
distilled from a long tradition of Tibetan tonal research (Jaschke 1881: xiii-xxi, Chao and Yu 1930, 
Miller 1955, Mazaudon: 1977: §3.1; Sprigg 1954 through 1993), it seems clear that the underlying 
tonal representations in Tibetan should be none other than H(igh) vs. L(ow) registers.^^ The 
observed pitch contours which appear on the last syllable of phonological words, on the other hand, 
differ from dialect to dialect and may vary from one phonological or sociolinguistic context to 
another even within the same dialect (Sprigg 1993). Such largely predictable domain-final contours. 



""^At least tor those dialects (e.g. Lhasa) where the (steep) falling pitch can he consistently iJerived from the presence 
ot the glottal-stop coda by an automatic allotonic process. It is only in such dialects as Shigatse (Qu 1981a. 1988, 
Huang 1993) and Shap Tsang (Ossorio 1982) where the loss of the glottal stop makes the steep pitch fall no longer 
predictable from the segmental structure, will it be justified to recognize both a register (high vs. low) and a contour 
(level vs. falling) distinction (for an autosegmental representation of tone in such dialects, see §4 below). 



(14) ma: 



tphe-ma: 
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-ather than being represented underlyingly, should be generated by dialect-specific detail rules, as 
ar gued above.'^-' 

Also at variance with Duanmu's analysis is our account of tne redundant high register on 
non-mitial syllables in Tibetan multisyllablic words. We contend, following a well-known principle 
in markedness theory, that the high register is the unmarked register in Tibetan since this is the value 
found in contexts uf neutralization (Greenberg 1966: 13-24). The high tone in Tibetan non- 
initial syllables, in other words, results not from assimilation to a H in the initial syllable but rather 
from phonological neutralization reducing the original tonal contrast to a redundant high register. In 
our analysis, this generalization is conveyed by a rule which replaces the original tones on non-initial 
tone-bearing syllables with the default register value H.^^ The tone rules for Lhasa Tibetan can 
now be given as ( 16) below: 

(16) a Tone Association (TA): equivalent to Duanmu's tone rules (19b), (I9c), and (19d). 

b. Low Tone Assimilation (LTA): If the tone of the initial syllable is L and the second 
syllable is long, spread L to the latter. 

c. Default Tone Replacement (DTR): Replace underlying tones on non-initial tone- 
bearing syllables with the default high tone (symbolized herein as boldfaced H). 

At this juncture, some lemarks are in order concerning two areas of Lhasa lonology not 
touched upon in Duanmu 1992. First, many grammatical elements such as case markers, verbal 
endings and sentence-final illocutionary particles behave as unstressed toneless enclitics in Tibetan 
(Qu 1981b: 20; W fing 1984). Such enclitics, for example the perfective aspect marker -p»-, are 
extrametrical in tu°t the host syllables they are attached £o are characterized by domain-final 
contours, as if the encliucs do not count, as part of the tonal domain (Qu 1981b: 20; Mazaudon 
1977-82-3; Durand 1990:21 1-5). Furthermore, DTk also applies vacuously to toneless syllables 
which do not possess corresponding slots on the tone tier to serve as landing sites of the default 
hi"h tone 25 Toneless svllables are to be distinguished from cases like the imperfeclive aspect 
marker -ki- in Lhasa which, being bound morphemes, never occur by themselves in natural speech 
and therefore are lexically unspecified for tone. Unlike toneless syiiables, however, such bound 
forms do hold places on the tone tier (hence the slot-holding underline below) and aie entitled to 
receive the default tone. 

In thfc sample derivations of (17) below, contrast the underlying tonal representations of the 
two phonological words pl-ki-re 'will die' and pi-pd~re 'died', consisting of the verb root 
pi 'to die', the enclitics =-ps>- and =-A-f- and the (optionally) toneless auxiliary re (< WT red 
'copula' ):2'' 



^^Thu. there shouid be distinct allotonic rules for Lhasa and Gai/.e Tibetan to the efl-ect that for instance the 
underlying low tone tends to be realized in short open syllables with a slight rising pitch in the former d.alect but 
with a slight falling pitcb. in the latter. 

It least one other Tibdo-Burman lancuage, Meiihei. the high tone is also analyzed as the default tone: see 
Chelli* iSi ¥1^993: 157 aftSbutes the default high register on the second syllable in T-betan compounds rather 
to the deletion of the laryngeal node (and also the subordinate feature [murmur]) on that syllable. 
■^^Thcre are as yet no experimental .studies devoted specifically to the surface pitch shapes of t«"f2^7JI-eSm; 
Wang 1985: 89 observes that such syllables are usually spoken ai un indistinct mul pitch, but .^^en the p eced ng 
domain-final contour is falling, the fM is normally spread to the toneless syllable. For example, the surface pitch 
contour of the word tpre-la 'then; afterwards" KWTr/«-/a)is t<?el^»-la«< . 

Although this copula is normally weakened to a toncless clitic, it can also be pronounced as a low toned fuU 
syllable rf? in deliberate speech (Wang 1985: 86-8). 
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. TA _ DTR 

Segmental tier: pi-ki-re — > pi-ki-re > ci-ki-re 

Tone tier H_0 H_0 HH 



TA DTR (non-applicable) 

tegmental tier: pi-pa-re -— > pi-ps-re - -> 

Tone tier: H 0 0 H 



pi-pa-re 
I 

H 



We turn now to the input conditions of Low Tone Assimilation above, which require that the 
second syllable be long, consisting either of ,in open syllable with a geminate vowel or diphthong or 
a syllable closed by a sonorant coda. Experimental studies on Lhasa Tibetan have shown that the 
duration of such syllables is roughly double that of short syllables (Hu et al. 1982) and in the case 
of closed unchecked (i.e. sonorant-coda) syllables, the length of the coda equals that of the 
preceding nuclear vowel (Tan and Jiang 1991). Evidently then, long syllables in Tibetan are 
bimoraic where the second vowel in the case of long open syllables or the sonorant coda in the case 
of cosed syllables occupies a separate mora.^' Also, the LTA rule in Lhasa and elsewhere^^ must 
be made sensitive to moraic structure, otherwise its restriction to long (bimoraic) syllables would be 
unexplained. This furnishes direct evidence that the tone-bearing unit in Tibetan is the mora rather 
than the rhyme or the syllable.^^ Hence, LTA can be reformulated as (18): 

(18) LTA (revised): If the tone of the initial syllable is L and the second syllable is bimoraic 
spread L to the latter, causing the originally associated tone on the first mora to delink. 

_Consider now the revised derivation of the same compound me"po:24 (in our transcription 
tte~poo) 'coal-pan': ^ 



(19) TA LTACrevised) DTR 

^^'Po*^ > i^e-poo .-> me-poo > me"-poo24 

(I I '\ ' A 

H L H 



L H 



H 



Another case of low tone spreading is presented by Zedang Tibetan where the medial 
syllable in tnsyllabic compounds, rather than bearing the default high tone as in the other dialects 
becomes low-toned if the tone of the initial syllable is low. As argued above, this particular type of 
low-tone .spreading obviously involves the same underlying pmress as LTA and therefore should 
not be treated by distinct mapping mechanisms. Instead, we propose for Zedang Tibetan a dialect- 
specific tone rule which simply spreads an initial low tone to the word-medial syllable. In contrast 



Checked syllables in Lhasa Tibetan, including those with a nasal coda followed by the glottal stop -9, are shorter 
m duration even than short open syllables (Tan and Jiang 1991). Such syllables are clearly monomoraic 
Ht?'.^,[ fn^^uftl^lu '-1^? ^i'!!i.'^.'o"';tc'ns spreading rules include varieties of dBus Tibetan spoken near the Lhasa 

f e-in) '^"-^ ^p«^- 

ibJI^irSStr j^iSly Si '''''' °" "y-" ("^^3: 77) claims 
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with LTA where the spreading L affects only a neighboring mora, this rule is more thoroughgoing in 
causing the entire affected syllable to be assimilated to L. The tone rules for Zedang Tibetan (which 
also has LTA) are as follows: 



(20) a. Tone Association (TA): = (16a) 

b. Low Tone Assimilation (LTA, revised): = (18) 

c. Default Tone Replacement (DTR): = ( 1 6c) 

d. Trisyllabic Low-Tone Assimilation (TLTA): In a trisyllabic phonological word, if 
the tone of the initial syllable is L, spread L to the medial syllable and delink the 
originally associated tone. 

Consider the derivation in (21) for the compound Ij-tpe-pa 'cadre'in Zedang:""' 

(21) TA DTR TLTA 
le-tce-pa— -> Is-tpe-pa > le-tpe-pa > le"-t^e"-pa53 

\ v r 1 I U-" i I 

LL_ LL_ LHH L SH 

Thus, although v^e do not ascribe the redundant non-initial high tone to a H spread from the 
first syllable, tone-spreading rules do have a role to play in our analysis; namely, they are reserved 
for cases of genuine processes of tonal assimilation such as LTA and TLTA. 

3. Related Issues. The foregoing analysis of Tibetan tone hinges on, among other things, the 
existence of the default high tone and tonal processes that derive word tones from syllable tones. In 
what follows, additional data will be brought in to further motivate these descriptive devices. 

3.L Why is the Default Register High in Tibetan? It will be recalled from §2.2 that the 
postulation of the high register as the unmarked or default register value in Tibetan stems strictly 
from marking phenomena observed in synchronic Tibetan phonology. In order to understand this 
particular skewed distribution of the high vs. the low register, it is necessary to vemure beyond 
synchrony and consider the paths along which contrastive tone arose in Tibetan. Comparative 
evidence presented in §1.1 suggests that Tibetan originally must have been in a state where, the 
effects of stress and intonation aside, all stressed syllables were normally in the high register. The 
existence of this atonal par excellence proto-stage on the Tibetan tonality continuum seems 
beyond doubt, for this is what we actually find in many Amdo dialects, as indicated above. The first 
significant change altering this incipient state was the emergence of the non-contrastive low 
register conditioned by voiced initials in such dialects as Labrang. It should be emphasized that in 
this dialect the phoneticafly conditioned 'register split' is limited only to the initial syllab'e, whereas 
all non-initial syllables are still high-registered, much as in the tonal dialects (Hua 1980: 72, 
Hu 1980: fn. 20). The next diachronic step is taken when, as a result of phonological attrition of 
syllable initials, the low register came to be minimally distinguished from the high register in certain 
environments, as in the case of Amdo Sherpa mentioned above (Nagano 1980). At this juncture, the 
interactions of the two pitch registers in many Khams dialects seem highly suggestive. In Derge 
and Batang CBa'-thang), for instance, syllables which bore voiced obstruent initials in Old Tibetan 
became low-registered if the original initial underwent devoicing, but stay high-registered where 



■''^Note that the underlying tone of the bound agentivt morpheme -pa 'the one who..." is unspecified in our analysis. 
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devoicing has not happened, as shown in these Derge examples: kg 'hear' < WT go; gu 'nine' < 
WT dgu (Qu 1988:323).^' Furthermore, register on syllables with synchronically voiced 
obstruent initials show variation in register which is apparently random in some dialects (e.g. Derge; 
Huang 1993:3; Chamdo; Liu 1984) or apparently conditioned by the articulatory positions of the 
root initials in others (e.g. Batang: jjd^cc 'rice' < WT 'bras; Jid^ce 'resent' < WT 'gras.; 
Gesang 1985: 24). The Khams data above would be hard to explain if the phonetically conditioned 
low pitch is assumed to be always present on voiced-initial syllables (e.g. Hu 1980:31). If, instead, 
we assume a uniform high-register starting point for Tibetan, then the scenario of tonogenesis in this 
language can be conceived of as the emergence of the distinctive low register which has 
encroached gradually on the territory of the high register. ^^d, moreover, the reason that 
the unmarked register in modem Tibetan is high is simply that this represents an original state of the 
ancestral language ubiquitously retained in the modem dialects. Thus, granting the non-initial high 
register default status is not only well-motivated synchronically, but also congruent with an 
important generalization in Tibetan phonological diachrony. 

3.2. Is There 'Tone Sandhi' in Tibetan? As indicated above, the proper domain for tone in 
Tibetan is the (phonological) word where, regardless of the number of constituent syllables, 
contrastive pitch register is borne by the word-initial syllable. Since most morphemes in Tibetan are 
tone-bearing monosyllables (hence the apt term 'morphosyllables' proposed by Light 1978), the 
question arises as to how, given a multisyllabic word, the word tone should be related to the 
underlying tones of the constituent morphosyllables. Sprigg 1975:179 argues explicitly against 
deriving the latter from the former: 

I . . . find it structurally misleading to describe the lexical items sgam 'box', ja 'tea', and 
yi(g) 'letter' that occur in the first-syllable place of the words sgam-chung . ja-ldnng, and 
ge with contrastive low pitch as having changed tone fiom low tone to high tone in the 
words Icags-sgam , gsol-ja, and lam-yig simply because, in these last three words, those 
lexical items have a high pUch, the non-contrastive high pitch appropriate to the second- 
syllable place in those words . . . 

Ossorio 1982: 57; 1 14, sharing Sprigp's conviction, denies in even stronger terms the existence of 
'lone changes' in Tibetan not only diachronically, but also synchronically: 

There is no evidence that the restiicted tonal pattems of multisyllabic words ever developed 
through sandhi changes of the tones carried by the monosyllabes involved . . . Using the 
word as the domain of tone there are no tones to be raised on non-initial syllables; never, at 
any stage in the derivation, do such syllables carry low tone. 

These claims, however, must be reconsidered in view of the counter-evidence to be presented 
in the following sections. 



synchronic state is the exact inverse of the situation portrayed in the slogan /t it5 >f 18. 'Hleh 
pitch if voiceless; low pitch If voiced' seen above. ^ ^ e n; .i.^n 

-^^The effects of the low register encroaching on non-initial syllables can be witnessed in the various rules of Low 
Tone Assimilation discussed in this paper (see 25b, 25d, in §2.2 and 27 in §3.2 below). 
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3.2.1. Low Tone Assimilation in Purang. In §2.2 we inspected in detail a case of tonal 
assimilation, LTA, attested in many Tibetan dialects. An interesting variant of LTA is reported in 
the Purang dialect where the initial low tone spreads to the right when the second syllable is 
originally low- toned, but unlike in the ordinary LTA, tone-spreading does not occur if the second 
syllable is originally high-toned (Tan 1984:633-5). The Purang version of Low Tone Assimilation 
(LTA') can be formulated as (22) (tone rules in Purang are otherwise similar to those in Lhasa): 

(22) LTA': If the tone of the initial syllable is L and if the second syllable is low-toned and 

bimoraic, spread L to the latter and delink the originally associated tone on the first mora. 

Contrast the derivations in (23-4) of the compounds pir)"tglieenii 'serf (phonemicized herein 
as pir)-tglien < WT zhing 'field' + bran 'slave') and mii%aar)55 'populace' (phonemicized 
herein as mi -mar) < WT mi 'human' + dmangs 'multitude, vulgar'): 

(23) TA LTA' DTR 
{}ir)-tglien > {}ir)-tglien > pir)-tglien > pir)iitglieni* 

/\ i" A /\ /\ 



L L L L H 



(24) TA (LTA': non-applicable) DTR 

mi -mar) —-> mi -mar) > >miii-mar)5S 



v 



V 



L H L H 



Thus LTA', ? tonal process which shapes the tonal melodies of multisyllabic words, can 
clearly access the underlying tones of the component syllables.-'-' 

3.2.2. Negator aa- in Lhasa.-''^ Another revealing example of the interactions between syllable 
and word tones is furnished by the negator morpheme ma - in Lhasa Tibetan.-'^ One unusual 
property of ma~, which always forms a single phonological word with the verb stem onto which it is 
tagged, is that its tone is always identical to the underlying tone of the verb stem, although owing to 
its non-initial position the latter always ends up bearing the default high tone. Since the inherent 
tone of the bound morpheme ma - cannot be ascertained (barring recourse to spelling 
pronunciation), we suggest that it is lexically unspecified for tone (i.e. with only a place-holde/ — 
on the tone tier), and that its surface tone is automatically received from the verb stem by applying 
ordinary tone association. The underlying tone on the verb stem itself is then reassociated with the 
default high tone under DTR. Sample derivations of the negative forms of the imperfective verb 



^^Incidentally, the Purang data (as well as the data concerning the prefix ma- in Lhasa, see below) cause 

embarrassment to Duanmu's analysis, as all non-initiui underlying tones are deleted from the tone tier at the start of 

the derivation, making it impossible tor other tone rules to refer to them later. 

•^'^The imperfective negator mi - in Purang Tibetan shows the same tonal behavior (Tan 1984: 637). 

35This prefix is used both with the perfective aspect (in contrast with the low-toned imperfective-aspect negator mx- 

) and in prohibitive commands. 
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stems te9 'saw' (< WT Itas) and t§i9 'wrote' (< WT bris), respectively ma-te9 'did not see' 
and ma-t§i9 'did not write', are offered in (25-6) below (Qu 1981a:24): 

(25) _ TA DTR 

ma-te? — ->ma-te'? > ma55-te'?52 

N I J 

_ H 11 H H 

(26) TA DTR 

ma-tgi? — ->ma-tgi? > ma"-tgi'?52 

Since here the original syllable tone of the verb stem is inherited by the ma - morpheme on its left 
rather than completely obliterated by tone neutralization, a purely distributional approach to the 
problem at hand (such as Sprigg's prosodic analysis) will fail to give a principled account of the fact 
that ma - is high-toned in one case but low-toned in the other. 

The preceding data constitutes strong empirical evidence that, in synchronic word-formation 
at least, derivational relations do exist in Tibetan between tones of the constituent syllables and the 
melody of the multisyllabic word as a whole, and that consequently any adequate description of 
Tibetan tonology cannot do without 'tone sandhi' rules. 

4. Conclusions. In sum, we have witnessed in modern Tibetan a continuum of increasing tonality, 
reflecting various stages of the gradual emergence of lexically distinctive pitch. A typical tone 
system in Tibetan differs in two important respects from a typical Sinitic tone system: (i) the basic 
tonal contrast is that of simple pitch registers: high vs. low: (ii) the register contrast is realized only 
on the first syllable of a given phonological word. So restricted is this type of tone system, in fact, 
that tone in multisyllabic Tibetan words may be viewed as adhering by and large to a simple tone 
template (where 03 = phonological word; a = syllable; H = default high register): 

(27) 0) 

a (a) a 
I I I 
H (H) H 

L 

As indicated above (§1.1), some tonal dialects seem to be undergoing change in the direction of 
adding an intersecting dimension of contour contrast (high level vs. high falling) on the final 
syllable. One straightforward way to express this target of sound chat^ge in autosegmental 
phonology is to add a L to the right of the basic contrast of H vs. L, yielding four distinctive 
monosyllabic tone patterns: H, HL, L, and LL. The realizations of these four underlying tones in the 
Shigatse dialect is given in Table 6 below (Qu 1981a: 189):-^^ 



The level/fall contrast is restricted to sliort rather than long syllables in all three dBus-jitsang dialects discussed in 
Qu 1981a. This is contrary to what is found in one variety of L hasa where the rhyme gets cornpensatorily lengthened 
with the apocopy of the glottal stop, resulting in the contour contrast being cwricd by long syllables (Kjellin 1977). 
Note that 'his means the contour distinction may not necessarily depend on bimoraic syllable structure, contra Yip 
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H 


HL 


L 


LL 


Syllable Type 


Realization 


53 


51 


12 


131 


short 


55 




113 




long 



Table 6. Shigatse Underlying and Surface Tones 



To account for the fact that the contour contrast is limited to the last syllable in a multisyllabic word, 
we need only posit a simple rule which realizes (i.e. associates to the segmental tier) the second half . 
of composite tones only at word-final position. The innovative tone template may then be 
represented as (28): 

(28) CO 

H (H) H 
L |HL 

It is noteworthy that in (28) the secondary contour distinction is superimpo.sed on the basic tone 
template of (27), such that the final syllable, like all non-initial syllables, still bears the default high 
register. 

In Sino-Tibetan, tone systems that are closest to the Tibetan one are those found in such 
dialects/languages as Dongkou (Xiang; Yue-Hashimoto 1987: §2.1) and New Shanghai (Northern 
Wu; Duanmu 1992) in Sinitic, and PaTani (West Himalayish, Saxena 1991), Tamang (Tamang- 
Gurung-Thakali-Manang; Mazaudon 1977: 54-7; Weidert 1987: §7.1.4; Sprigg 1990), and Konyak 
(Northern Naga; Weidert 1987: 215-6; 414-5) in Tibeto-Burman. All of these systems are 
characterized by initial-dominance (Yue-Hashimoto 1987: loc. cit.; Duanmu 1992:68), whereby in 
a multisyllabic domain the pitch pattern of the entire domain is borne solely by the initial syllable 
with the sweeping reduction of tonal contrast on non-initial syllables. The tone systems of 
Dongkou Chinese and Tibetan differ from the other systems wich respect to the behavior of tone on 
non-initial syllables. In the former systems, non-inii:ial tones are largely independent of the initial 
tone, abiding by a more or less constant tonal template such as the ones shown in (27-28) for 
Tibetan. In the latter systems, however, the tonal melody of the initial syllable is mapped onto the 
entire multisyllabic tonal domain. Two distinct types of Sino-Tibetan word-tone systems, therefore, 
can be distinguished: template word-tone systems, represented by Tibetan (Tibeto-Burman) and 
Dongkou (Sinitic), vs. (adopting the terminology in Weidert 1979:84, fn. 28) contoureme word- 
tone systems, represented by Tamang (Tibeto-Burman) and New Shanghai (Sinitic). Needless to 
say, contoureme systems should represent the more fully tonal type on the tonality scale since the 
contrastive melodies in such systems are distributed (spread) to the individual component syllables 
in the domain, whereas in template systems non-initial syllables contribute nothing to the realization 
of the contrastive word-tones. 

In conclusion, if our phonological descriptions are to properly reflect this important 
distinction in the typology of Sino-Tibetan word-tone systems, then we should reserve melody- 
mapping analyses for languages like New Shanghai, and adopt instead default-tone analyses, such 
as the one proposed in this paper, for languages like Tibetan. 
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1. Introduction 

This paper studies one specific discourse pattern where zero 
anaphora in Mandarin Chinese often occurs: the switch-reference 
pattern. By comparing the use of zero with overt linguistic markings 
in the discourse patterns of switch reference, the study presents 
the argument that zero anaphora can be used in the switch reference 
pattern because there may be different cognitive strategies in 
reference tracking by speakers of different languages. 

This study follows the belief that language is used for the 
purpose of communication (Fox, 1987; Fox & Thompson, 1990). The 
study tries to show that language, hence grammar, is contextual, and 
linguistic information is indexical. It is due to tho indexical 
nature of language that information about a referent which is coded 
by a zero anaphor can * emerge' into the Chinese speaker's cognitive 
understanding in their process of Chinese discourse. 

Cross-linguistically, the pattern of switch reference can be 
summarized into three different types: 

a. the canonical type, 

b. the non-canonical type, and 

c. the inference type. 

Of the three types, the third one occurs in Mandarin Chinese 
(hence Chinese) and is the center of the discussion. 

Chinese is a zero-anaphora language, a language that permits 
abundant use of zero anaphora in its written and oral discourse. 
Former studies attribute the choice of zero anaphora as opposed to 
overt anaphoric devices to the discourse notion of topic continuity 
or to the topic chain construction (Chen, 1986; Givon, 1983a; Li & 
Thompson, 1981; Pu, 1989; Tsao, 1979). This study demonstrates that 
in Mandarin Chinese, the occurrence of zero anaphora is not 
constrained to this pattern only. Instead, in the multiple-referents 
discourse environment where there are two referents interacting, 
zero anaphors may code either or both referents; thus the referent 
of the zero may switch between discourse topics. In addition, the 
study shows that the switch of discourse topics coded by zero 
anaphors results in a * switch-reference' pattern that is not 
manifested by overt linguistic markings; reference-tracking in this 
inference type ^switch-reference' pattern can be explained by a set 
of cognitive strategies termed * emergent referents'. 

The plan of this study is as follows. In Section 2, the paper 
examines the three types of switch reference patterns by comparing 
how languages facilitate reference tracking in discourse at the 
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levels of morphology, syntax, semantics and/or pragmatics; in 
Section 3 , the study tackles the question of how reference tracking 
is possibly carried out by speakers of Chinese (and by speakers of 
other zero anaphora languages as well). In Section 4, the study 
concludes by showing the result of an experiment testing the 
hypothesis of ^emergent referents*. 



2. Reference-traclcing in discourse and the switch-reference pattern 

2.1. Subject/ topic and topic continuity 

I first discuss tvo definitions that are closely related to the 
pattern of switch reference: ^grainmatical subject' and ^discourse 
topic' . 

The role of subject is important to this study for two reasons: 
it is one of the grammatical slots where zero anaphora occurs in 
Chinese; and it is closely related to the canonical type of switch 
reference pattern, where in most cases subjects are marked for their 
referential ity with some other subject. 

I do not intend to go into detailed analysis on this issue, but 
I do want to roughly define what this study takes as the subject. A 
subject is the phrase (or clause) that has a grammatical 
relationship with the predicate verb in the following fashions: it 
is one of the main arguments or the only argument of a predicate 
verb (Chao, 1968; Ding et al., 1979) the S of an intransitive verb 
or the A of a transitive verb. 

A topic in this study refers to an NP referent that is the 
center of a discussion in discourse; thus it may be the focus of the 
discourse (Grosz, 1977, 1980). Since this type of topic can only 
occur in a discourse setting, it is referred to as the discourse 
topic in this study^. Topicality in discourse is determined by how 
much the noun referent is in the speaker/hearer ' s conscious mind 
while processing discourse information (Givon, 1983a) . 

The occurrence of zero anaphora is closely related to the* issue 
of discourse topic in Chinese. It is around this role of topic that 
zero anaphors demonstrate a switch reference pattern in Chinese. 

This study is mainly about the occurrence of zero anaphora as 
opposed to other overt anaphoric devices in the switch reference 
patterns. According to Givon, the choice of anaphoric devices 
follows a scale which reflects the discourse pattern of topic 
continuity, as illustrated below. 

la. Givon (1983a: 18) 

most continuous/accessible topic 
Zero anaphora 

pronouns or grammatical agreement 
Full NP's 

more discontinuous/inaccessible topics 



In addition to the discourse topic, there is also a topic in 
the topic-comment construction, which occurs very often in Chinese. 
For a detailed discussion of this construction, see Chao (1968) , Li 
and Thompson (1981) and Tao (1993). 
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b. Iconicity principle (1983b: 67) 

The more continuous/predictable is the topic/ subject/referent 
NP, the less overt expression it needs to receive. 

Both la and lb reflect a cognitive tendency cross- 
linguistically for the choice of anaphoric devices in discourse. The 
principle in lb offers a cognitive explanation for the scale of 
topic continuity as illustrated in la. Both la and lb indicate that 
when a referent is mentioned continuously, and when there is no 
other NP referent that may be mistaken as the same referent, then 
the information of this referent is easy to retrieve in people's 
short-term memories, thus the less overt linguiiitic coding is 
needed. According to Givon, topic continuity is the discourse basis 
underlining the phenomenon of switch reference in all languages 
(Givon, 1983b) . 

The iconicity principle argues for the choice of anaphoric 
devices based on the pragmatic needs of discourse. The principle 
predicts that when there is a switch of discourse topic/grammatical 
subject, information of the new referent may be discontinuous or 
less predictable (than when the topic/subject is continuous) ; hence 
more linguistic coding device (e.g., a full NP as opposed to a 
pronoun or zero anaphor) is used to facilitate reference tracking. 

This study contends that on the one hand, Givon' s iconicity 
principle can be illustrated in languages with different switch 
reference patterns; yet on the other hand, this principle has 
neglected the strength of inference in discourse so that it does not 
offer adequate explanations of certain discourse patterns in 
Chinese. I discuss the second point after we examine the switch 
reference patterns. 

This study only examines zero anaphors whose referents are 
third persons or objects. The study covers only the zero anaphors 
that are discourse related. Namely the understanding of the zero has 
to depend on the local discourse environment. 



2.2. Switch reference 

Reference presentation is a very important aspect in forming 
discourse cohesion, and anaphora functions as one of the major 
reference tracking devices in the discourse of the world's languages 
(Foley & Van Valin, 1984). 

In the introduction section, I proposed three types of switch 
reference patterns cross-linguistically, which are repeated here: 

a. the canonical type, 

b. the non-canonical type, and 

c. the inference type. 

Of the three types, the first two types are similar in that 
both use overt linguistic markings to indicate whether or not a 
certain NP referent is corefere.ntial with some other NP referent in 
the discourse. The difference between the two is that languaaes with 
the canonical type of switch reference pattern have grammatical ized 
this phenomenon with certain morphological markings to indicate 
switch reference. Languages that belong to the second type only 
signal the switch of a referent by some overt anaphoric device. The 
third type refers to a switch reference phenomenon where zero 
anaphora occurs as in Chinese. 
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This type of taxonomy as shown above may not seem conventional 
with regard to the notion of switch reference. But I feel that for 
the study of the communicative function of language, it is well 
justified to compare any linguistic or non-linguistic means that 
serve similar functions of facilitating reference tracking. 

Let's first look at the three different types of switch 
reference patterns. 



2.2.1. The canonical type of svitcb reference 

Formally, languages belonging to this switch reference system 
all have some grammatical i zed patterns marking noun referents in 
discourse. The canonical marking of switch reference "is an 
inflectional category of the verb, which indicates whether or not 
Its subject is identical with the subject of some other verb" 
(Haiman and Munro, I983:ix). 

"Functionally, switch reference is a device for referential 
tracking" (Haiman and Munro, 1983 :ix). 

Psychologically, a grammaticalized pattern marking referential 
Identities and the possible range of nominal referents can 
facilitate reference tracking in discourse processing. 

With regard to Givon's iconicity principle, the canonical 
switch reference languages have all developed special grammatical 
devices to code the pragmatic condition as stated in the principle. 

Languages with canonical switch reference patterns have a wide 
distribution, they include Manchu-Tungus languages (Nichols, 1979), 
the Papuan languages, some of the Austronesian languages (e.g 
L-enakel, Lynch, 1983), most of the North American languages (e.g. 
Central Pomo, (Mithun, 1993)), and some African languages (Comrie] 
1983) . Most of these languages demonstrate a relatively strict word 
order of ov (a few have a VO word order, e.g., Lenakel) , and almost 
ail the morphological markings of SS (same referent) and DS 
(different referent) are on the verbs. The clause whose predicate 
carries the marking may be called the marking clause. The markings 
indicate whether or not the subject is the same or different from 
the subject of the preceding or following clause, which may be the 
reference clause. The marking clauses may be subordinate to or 
coordinate with the reference clauses. In addition to indicating 
referential status, the markings on the verbs may code other 
grammatical functions such as aspect or valence, as can be seen 
below. 

An instance of canonical switch reference can be illustrated in 
Central Pomo, a native American language spoken in Northern 
California. In this language, switch reference is manifested by 
three pairs of aspectual markers. Let us look at the pair -ba (SS) 
?SQT-"nV; X ^^^^ ' ^^^^^ used in realis constructions (Mithun, 

2a. Subject==Subject (SS) 

?a_ chaw=y6-ba mat.i ?-cha-ch-ba ma?a 
I.Agt in=go-same down by.gravity-sit-INCH-same food 

qa. -yu?chi-w 
biting-begin-p 

U came into the house, (I) sat down, and (I) started to eat.' 
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b. Subject=/=Subject (DS) 

?a chaw=y6-w=li hayu=?el ?uda-w t.o. 
I.Agt in=go-p=DIFF dog=the really 

se-ch-inad=a 

glad . to . see-AFF=IMM 

came into the house, and my dog was really glad to see me. • 

We can see from these two examples that the referential status 
of the subjects are clearly marked by the morphological structures 
of Central Pomo^. Notice that in 2b, even though the information of 
the second referent is shown clearly by a full NP (my dog) , the DS 
marker is still used, a fact indicating the grammatical ized switch 
reference pattern in this language. 

In the Manchu-Tungus language family, switch reference is 
carried out by a distinctive set of suffixes which occur 
consistently with SS or DS (Nichols, 1979). The marking clause is 
a nonf inite clause which has a subordinate or coordinate sense with 
the referent/finite clause. One of the systems used to mark SS or DS 
is that SS in the marking clause is indicated by a verbal stem with 
a suffix indicating number agreement with its own subject (plus 
tense markers) , and DS is manifested by a participial stem with a 
tense marker, an oblique case (usually dative) , and a possessive 
marker of agreement with its own subject. Following is an example 
from the Manchu-Tungus language family (Nichols, 1979:421). 

3a. (Ulcha, the Amur branch of the Manchu-Tungus languages) (SS) 

langi nene-meri, icheheti ... 

close go-pl.sim (they) saw 

^Coming up close, they saw (p. 421) 

b. (DS) 

ti duse kalchin -du -ni ni -de, ingda-da aurasi bichini 
dem tiger approach-dat-3sg man-ptc dog-ptc not-sleep aux 

(past ppl) (Pos) 
When the tiger was in the vincinity, neither people nor dogs 
slept. (p. 421) 

In these examples, the verbs in the dependent clauses reflect 
two different forms. In 3a, the verb of the marking clause is the 
verbal stem plus the number agreement with its own subject, which is 
the same as the subject in the referent clause. 

In 3b, however, the verb in the marking clause is a past 
participle with a dative case marker and a third person possessive 
marker, indicating the subject is different from that in the 
referent clause. In the Manchu-Tungus languages, it is almost always 



2 These markers indicate switch reference consistently in 
elicited sentences. But in natural discourse, occasionally they are 
inconsistent. The inconsistency reflects the fact that the primary 
function of these markers are the aspectual relationship between 
two clauses; switch reference is its secondary function. For a 
detailed discussion, see Mithun (1993) and Watkins (1993). 
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different verb forms and their different affixes in the marking 
clauses that signal switch reference. 

Lenakel, an Austronesian language, is different from most other 
languages having the canonical sv/itch reference pattern. This 
language is not verb final, and the; switch reference markings are 
prefixes to the verbs. Following are two examples (Lynch,, 1983:211). 

4a. i -im -vin (kani) m-im -apul' (SS) 
lexc-past-go (and) ES-past-sleep 
I went and slept. 

i -im -vin (kahi) r -im -apul DS) 
lexc-past-go (and) 3sg-past-sleep 
I went and he slept. 

What is unique in Lenakel is that the switch reference mairkers 
are prefixes. The verb of the firsit clause takes pei;son and number 
prefixes which agree with the subject of this clause; the verb of 
the following clauses take prefixes marking whether or not the 
subject of the following clause is coreferential with (or echoes) 
that of the first clause. 

What languages with the canonical switch reference pattern have 
in common is grammatical ized markers on the verbs which indicate 
referential status. Functionally, all of these grammaticalized 
patterns seem to share the property of facilitating reference 
tracking. 



2.2.2. The non-canonical type of switch reference 

These languages differ from the canonical type in that they 
lack specific marking systems to mark switch reference on the verbs. 
Instead, these languages only use the anaphoric devices (e.g., the 
stressed independent pronouns (Spanish, as discussed by Bentivoglio 
(1983)), the logophoric pronouns (e.g., Igbo, as discussed by Comrie 
(1983), etc.) to indicate referential status. 

As has been mentioned in the previous section, psychologically, 
a grammaticalized pattern marking referential identities can 
facilitate reference tracking in discourse processing. Even though 
the non-canonical type of switch reference does not resemble the 
canonical switch reference pattern, functionally they both serve the 
goal of reference tracking. Givon argues that these languages follow 
the iconicity principle to code subject referent continuity, as 
stated in lb (e.g., English, (Givon, 1983b)). 

Let's first examine the followin9 English expression (Givon, 
1983b: 59) . 

5. He gave presents to the King and the queen. He thanked him, but 
she just grunted. 

Since English has a M/F gender system in its pronouns, the use of 
stressed pronouns here is sufficient to code the switch of subjects. 



The abbreviations in the morpheme to morpheme analysis 
are used for: exc: exclusive; ES: echo-subject. 
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in some West African languages, there is a set of logophoric 
pronouns that indicates coreferential status. Examine the next 
example from Igbo (Comrie, 1983) . 

6a. 6 siri na 6 byara. (DS) 
he said that he came 
^Hel said that he2 came.* 

b. 6 siri na ya byara. (SS) 
he said that LOG came 
^Hel said that hel came.' 

These logophoric pronouns can code different referential status 
among single third person referents to facilitate reference 

tracking. . • •+. v. 

Stressed independent pronouns may also indicate swircn 
reference. In comparing the choice of bound personal pronouns and 
stressed independent pronouns in Spanish, Bentivoglio (1983) argues 
that certain occurrences of stressed independent pronouns function 
merely to signal the switch of reference. -, *. m 

From the functional and pragmatic perspective, almost ail 
languages reflect some forms of the non-canonical type of switch 
reference pattern to facilitate reference tracking m discourse 
processing. For instance, to deal with the English expression as 
illustrated in example 5, Chinese may have to use full NPs since the 
language lacks a gender system to distinguish the two referents in 
the spoken language. Yet when there is no need for such a special 
contrast, Chinese may use zero anaphors to code two different 
referents in a switch reference pattern. This is the pattern that I 
am calling the inference type. 

2.2.3. The inference type . 

The inference type of switch reference pattern m Chinese 
actually is the discourse pattern where the switch of subject/topic 
is not marked by any linguistic devices at all. In this case, zero 
anaphors are used to encode two different referents. Reference 
tracking is thus left for the reader/ speaker to infer by means of 
what is available in the discourse. 

The data for this study are from both written ana 
conversational discourse. The written discourse data are from four 
books r Hona Lou Menq by Cao Xueqin & Gao E; Dongtinq Hu S^enhua by 
Kang Zhuo; Zheng Honaai Xia by Lao She; and Zhonqquo Shenhua . 
Chuanshuo by Yuan Ke) . v. « 

The conversational data are from approximately three hours ot 
natural conversations recorded in four different places, Beijing, 
Changsha, Australia and the U.S. All speakers in the conversations 
are native speakers of Mandarin Chinese. The switch reference 
pattern with zero anaphors has been observed in all four books as 
well as in the conversations, a fact indicating that this phenomenon 
has existed in Chinese from the language of vernacular (e.g., the 
Chinese language as is used in Hnna Lou Menq) to contemporary 

usages. . 

A zero anaphor in the switch reference pattern can represent 
either a non-subject in the previous clause (the switch role 
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pattern), or it may *code' some referent mentioned in the prior 
discourse. All the referents coded by zero anaphors are considered 
discourse topics that are at the center of the discussion. 

In the following examples, each anaphoric device is assigned a 
number to indicate which referent it refers to. The location of the 
zero anaphor is somewhat arbitrary, based solely on my subjective 
understanding of the discourse data. 

Before we look at the switch patterns, I would like to show a 
Chinese discourse pattern where zero represents the grammatical 
subject of the previous clause. 

7. (Kang, 1991:163) 

yi -ci ta-menl jiao-hui -le zhe ge ySng-le ji -nian 
that one-time 3 -pi hand-return-PFV this CI feed-Asp several -year 

-> de r6u -hu62, 01 yAo -le Zhong-jia liCi-bSi xiandayang. 
GEN meat-goods take-Asp Zhong-family six-hundred silver-dollar 

That time they returned the * meat-goods' whom they had fed for 
several years, (and theyl) took six hundred silver dollars from 
the Zhong family*. 

This example illustrates a SS (same subject) pattern in that 
the zero represents the grammatical subject f tamen . *they') of the 
previous clause. The subject ^they' refers to a group of bandits. 
Since the two subjects occur in two adjacent clauses, this example 
forms a topic chain discourse pattern. The occurrence of zero 
anaphora is often attributed to this pattern. (Tsao, 1979. Li & 
Thompson, 1981) , v , , xax « 

But the next two examples show that zero anaphora may also 
occur in the discourse pattern with DS (different subject) ; thus 
topic chain xs not the only discourse environment permitting the use 
of zero anaphora. The first two examples illustrate a ^switch role' 
pattern where the zero represents a non-stibject in the preceding 

8a. (Lao, 1981:78) 

-> Fuqinl jin -niAn zhi mSi-le yi ke Wtisem6i2 . k6shi 02 
father this-year only buy-Asp one CL verbena but 

kai -hua po mii liqi. 
bloom-flower rather sell strength 

This year father 1 only bought one pot of verbena2, but (it2) 
worked rather hard to produce many flowers. 

b. (Cao, 1982:892) 

BSoyCi xian yin -le ban bei, 01 ch6u ren bu jjan, 
baoyu first drink-Asp half cup see people Neg see 



The ^meat-goods' in the story is the name given to a group 
of people raised by those bandits. They were brain-washed so th.-t 
they could be turned to the officials to take the blames for the 
banaits. 
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-> 01 di -yCi Fangguan2, 02 duan-qi-lai bian yi yang buo. 
pass-give Fangguan hold-up-come then one raise neck 

Baoyul drank half of his wine. Seeing that nobody was noticing, 
(hel) gave the rest to Fangguan2 , 
-> (who2) threw her head back and emptied it in a gulp. 

The parentheses in the English translations indicate that the 
elements inside are zeros in the Chinese versions. 

Both examples in 8 share a similar pattern: the zero codes the 
referent that is the object of the previous clause. Hence the 
discourse pattern is DS, but the referents of the zeros are still 
mentioned in the previous clauses. 

Next let us examine examples where the referent of the zero is 
not mentioned in the immediately preceding clause. We first look at 
an example from a natural conversation. 

9 . ( Changsha , p 1 ) 

->1. A: Ta2 (mao) jou tiao-dao di -shang-lai, 
it then jump-arrive ground-on -come 

->2. 01 dao -di gSi ta2 zhua -zhu le. 

till -end by it catch-stop PFV 



3. B: 



Shi ma? 
right Q 



4. A: ^ Nei ezil fei-lai cuan-qu de, 
that moth fly-come dash-go CSC 



•>5, 



01 yixia jiu dao zhei-bian lai -le. 

all-of-a-sudden then arrive this-side come-PFV 



->6. 



Yixia, 

all-of-a-sudden 



02 you bS tal zhua -zhu -le. 
again BA it catch-stay-PFV 



1. 
2. 
3. 



B: 



... It2 (the cat) then jumped down. 
(The mothl) finally was caught by it2 . 
Is that so! 



4. A: That mothl flew all around. 

5. In a flash, (1) flew over here, 

6. All of a sudden, (2) caught itl again 

In this example, the two referents (the cat and the moth) were 
interacting with the cat trying to catch the moth. We can see that 
the subjects of four clauses (clauses 1,2 and 5,6) are switching 
between the cat and the moth. Except for the first subject at line 
1, which is a pronoun, the remaining three subjects are all zeros. 
Although even the pronoun here does not differentiate between the 
cat and the moth, there is no confusion between the subjects of the 
verbs of various clause in '-his example, in which reference 
presentation is simplified to the extreme. 

Next is a similar example from the written data. 
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10. (Lao, 1981:119) 

1. Dui zhexie xiaoxil, 
to these news 



2. ta2 gaoxing ne, 
he happy Int 



3. 02 jiu xiSng yi xiSng 01, 
then think one think 



-> 4. 02. bu gaoxing ne, 
Neg happy Int 

-> 5. 01 jiu you zu6 Sr jin lai, 
then from left ear enter come 



-> 
-> 



6. 01 y6u 6r chu qu. 
right ear out go 

1. As for this kind of newsl, 

2. if he2 happened to be in a good mood, 

3. then (he2) would give (itl) a thought; 

4. if (he2) was not pleased at the moment, 

5. then (itl) would enter from his left ear 

6. and (01) exit from the right one. 



In this example, there is a person (ta2 , ^he') and an object 
(xiaoxil, ^news') . At lines 4 and 5, the subjects, of the two clauses 
are both coded by zeros, but their referents are different. The 
referent of the zero anaphor in clause 5 is not the same as that in 
the immediately preceding clause, but is mentioned overtly at line 
1. Yet such switch of referents does not seem to cause any problem 
to Chinese readers. 

The last example to be discussed reflects how a switch 
reference pattern with the use of zero is created. 



11. (Yuan, 1984:282) 

1. Taiyangl chu-lai le, ... Dang tal gang cong Yanggd chu-lai, 
sun out-come Asp when he just from Yanggu out-come 

01 zai Xianchi~li xl -le -ge zSo, 01 cong Fusang shu de 
at Xianchi-in wash-PFV-Cl bath from Fusang tree GEN 

xia -mian sheng-shang Fusang shu de dianding de zhe 
under-side rise -on Fusang tree GEN top GEN this 

shih6u2. Mil iiao-zuo "Chenming" . 
time right name-as Chenming 

2. 01 yljing sheng-shang Fusang shu de dianding, 01 zu6~shang 

already rise -on Fusang tree GEN top sit-on 

mama g6i zhUnbei hSo de chezi, 01 kaishi chufa le, 
mother for prepare well GEN chariot begin set-off PFV 
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zhe shih6u2 iiu iiaozuo "FSiming". 
this time then name Feiming 

->3. dao -le Qii'a de difang, 02 iiu iiaozuo "Danming". 
arrive-PFV Qu'a GEN place then name Danming 

1. The sunl came out,... When hel came out of Yanggu, (01) took a 
bath in Xianchi, and (01) ascended from the bottom of the 
Fusang tree onto the very top, this time2 was named Chenming. 

2. After (hei) had already ascended onto the top of the Fusang 
tree, (01) sat into the chariot already prepared for him by his 
mom and (01) started to set off, this time2 was called Feiming. 

->3. When (hel) arrived at the place Qu'a, (the time2) was named 
Danming . 

This example displays two things: the sun's activities in the 
morning, and the name of various time periods associated with the 
sun's ascension. In this example, the first referent, the sun, is 
often coded by a zero, but the second referent, the time, is 
presented in full NPs until the last part at line 3, where both 
referents are coded by zeros, and a switch reference pattern is 
formed. In this example, the repetition of the verb phrase ^ iiaozuo ' 
^to be named as' in association with the sun's activities have 
produced a pattern that makes the referents of the last clauses 
almost transparent. The use of zero in this case seems very natural. 

From examples 8-11, we can see that reference switching does 
not restrain the use of zero anaphora in Chinese discourse. Recall 
that Givon has proposed an iconicity principle predicting the choice 
of anaphoric devices with regard to subject continuity. The 
principle is repeated below. 

12. Iconicity principle (givon, 1983b: 67) 

The more continuous/predictable is the topi ;/ subject/referent 
NP, the less overt expression it needs to receive. 

In other words, the principle predicts that when a certain referent 
is unpredictable, when there are potentially two NPs that can be 
taken as representing the same topic/subject/referent, then more 
overt linguistic coding is needed to distinguish the referents. 
Languages bearing the first two types of switch reference patterns 
(the canonical and non-canonical types) all demonstrate the 
discourse pattern as predicted by this principle. As was mentioned 
at the end of section 2.2.2, sometimes Chinese discourse also 
reflects certain patterns that are predicted by this principle. But 
from examples 8-11, we can see that in other situations Chinese 
discourse can also produce patterns the principle fails to account 
for. 

It seems counter-intuitive to use the least coding device (zero 
anaphora) in the situation of switch reference, but a close 
examination may reveal that if we observe discourse processing from 
a different angle, the inference type of switch reference pattern 
may not seem so mysterious. I discuss this point in Section 3. 
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2 « 3 . Interim summary 

In this section we have examined three types of linguistic 
patterns coding the phenomenon of switch reference. The three types 
of switch reference patterns illustrate two extremes: on the one 
hand, it seems so important to facilitate reference tracking in 
language processing that some languages have developed specific 
grammatical patterns to obligatorily code switch reference; on the 
other hand, languages like Chinese can afford not to use any overt 
linguistic coding for similar discourse patterns. Thus we can see 
that language formation seems to follow a continuum of presenting 
information as clearly as possible (e.g., the canonical switch 
reference patterns) to maintaining the economy in communication 
(Grice, 1975) by using overt linguistic devices only when absolutely 
necessary (e.g., Chinese)^. 

The next question is, how do Chinese speakers tolerate the 
abundant use of zero anaphors? This issue is addressed in the next 
section. 



3. Emergent reference and the construction of grammar 

3.1. Emergent reference 

Chinese is an isolating language with no phonological or 
morphological markings indicating parts of speech, gender, case or 
grammatical relations. This ^ under marked' form once led some 
historical linguists to consider Chinese a primitive language that 
has not developed to the contemporary level (e.g. like English and 
other Indo-European languages) with conjugation, word class 
differentiation and so forth (Schleicher, reprinted in 1983). Even 
though such a view of Chinese was presented over one hundred years 
ago and most linguists, as well as people from other fields, have 
taken a view contrary to Schleicher's claim, the question of how the 
language functions with practically no morphology still remains 
unanswered. 

The use of zero anaphora reflected in the switch reference 
pattern m Chinese discourse suggests that it is very likely that 
looking at overt linguistic patterns alone does not offer sufficient 
explanation for the abundant occurrence of zero anaphora in Chinese 
The solution for the abundant use of zero anaphora can only come 
from the information in the discourse context. Here I suggest that 
reference tracking in Chinese is probably carried out with the help 
of discourse cues (Fox, 1987) , which may be one of or a combination 
of the following: 



Languages with the canonical type of switch reference 
pattern also require inference in information processing (see, 
e.g., Haiman & Munro, 1983; Mithun, 1993; Watkins, 1993). On the 
other hand, Chinese also requires overt linguistic devices when 
there is the need (see Tao, 1993) . This fact indicates that the 
general trend m language formation is, whenever possible, to 
provide information which is as explicit as necessary, but not is 
not excessive or redundant. 
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13. Discourse cues: 

a. prior discourse context; 

b. specific semantic requirements of the verbs associated with 
the referents; 

c. the specific nature of the referents presented by zero 
anaphora ; and 

d. language users' general knowledge about the world, including 
their social, cultural, and personal experiences. 

These discourse cues do not seem to offer a specific 
explanation for reference tracking since it covers almost everything 
that one needs to consider in language processing. Yet if we 
consider the actual language data, we can see that reference 
tracking in Chinese does rely on these cues, though not all the cues 
have to be considered in each individual case. 

In each instance of matching a zero to its proper referent, it 
is likely that Chinese speakers pay attention to one or two of the 
cues that are the most salient. The referent of the zero may thus 
* emerge' together with the discourse cue to complete the information 
needed in discourse processing. Let's first consider the two 
examples in 8. 

In 8a, it seems the semantic cue from the second verb kaihua 
* blossom' is sufficient for people to know that it is the plant that 
is performing this action; hence the referent of the zero in the 
second clause has to be the grammatical object WCfsfemei ^verbena' in 
the previous clause. 

In 8b, semantics of the verbs do not offer any specific cues 
since both people have the ability to drink wine. Here the discourse 
cue has to be the prior discourse information (Baoyu passing his 
wine to Fangguan) that informs us the one who drank the wine is 
Fangguan . 

In 9, people have to rely on the status of the two referents 
(cat the predator, and moth the victim) to make clear which 
referents the zeros represent at lines 1,2, and 5,6. 

In 10, one has to relate to the general knowledge about people 
(who can think) and about information (that can metaphorically enter 
people's ears) to infer that the referent of the zero anaphor in 
line 5 is not the human subject in the preceding clause. 

In example 11, it seems that the association of the verb 
1 iaozuo *to be named as' plus the discourse structure (a type of 
cause-and-ef feet pattern) makes it clear that the referent of the 
zero in the last clause is different from that in the preceding 
clause. 

From the above discussion, it seems obvious that the discourse 
cues are local in that one cannot come up with a summary of 
essential cues that can be useful to all discourse patterns (unless 
of course we call all the cues essential) . Next I would like to 
propose a model explaining how the cues are utilized when Chinese 
speakers process discourse information. 

At the beginning of this paper the study mentioned the belief 
that language is used for the purpose of communication. In the 
interactive usie of language, the choice of anaphoric devices 
reflects the speaker/writer 's cognitive understanding of the 
interactive needs in the communication; and reference understanding 
could reflect the hearer/ reader 's cognitive strategies. I refer to 
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one set of these strategies as ^emergent reference' in this study. 

Emergent reference refers to a set of cognitive strategies used 
by speakers of a language to process discourse information. To 
successfully comprehend any discourse with abundant use of zero 
anaphora, the hearer/reader must pay special attention to discourse 
cues associated with each referent represented by zero anaphora. 
These distributed local cues serve as the basis for the interpretive 
process from which the right referent emerges. The procedure to 
utilize these cues could be summarized below. 

14. Construction and integration 

a- Cue identification : When processing discourse information 

with many referents missing, language users are attuned to the 
specific cues the local discourse context provides, cues that Have 
to do with the referent?; 

t>' Reference construction ; While processing language, individual 
referents are integrated into an information pattern constructed 
with information from these distributed local cues; 

Informa tion integration ; By integrating the cues to the 
recurring zero anaphors, which now serve as the referents in 
question, the referents that are ^missing' due to the use of zero 
anaphora emerge, so that reference-tracking is not only possible but 
also easy. 

This ^construction and integration' model is based on the 
theory that information of a referent is indexical in that it comes 
from the local discourse context (Heritage, 1984; Fox, 1992). It 
also maintains that language is contextual in that, since the 
information about a particular referent that is coded by a zero 
anaphor actually comes out with the help of local discourse cues, 
the information has to be constructed locally. Referents thus 
^emerge' out of the local discourse context into language users' 
cognitive understanding. 

The construction and integration model is probably unique to 
speakers of Chinese and other zero anaphora languages as one of the 
major cognitive strategies used in discourse processing. This point 
is elaborated in Section 4. Next we look at how grammar is at work 
with regard to the inference type of switch reference pattern. 



3.2. The on-line construction of grammar 

The above discussion suggests that the use of zero anaphors in 
the switch reference pattern is conditioned by local discourse 
formation. Here the study also maintains that this switch reference 
pattern also reflects the fact that grammar is contextual. 

The linguistic phenomenon discussed in this study seems to be 
related to pragmatics, which is often set aside from the ^core' 
studies of, e.g., morphology and syntax, when we talk about grammar, 
people often think about the well-formedness of individual sentences 
(at least when grammar refers to syntax) . But at the same time 
people cannot avoid putting sentences into some context to judge 
their acceptability. Thv:s one cannot ignc^« the fact that very often 
the acceptability of an expression has to be contextual. 

When people claim that Chinese is a zero-anaphora (pro-drop in 
some theoretical framework) language, they are looking at the 
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abundant use of zeros in its discourse, not at the hypothetical 
zeros that one finds in many languages (e.g., Tal xigina 01 qu . Hel 
wants 01 to go) ; thus the claim about Chinese is based on its 
discourse patterns. 

Based on the switch reference pattern in Chinese discourse, one 
can see that the occurrence of zeros follows certain local 
contextual cues: whenever a zero is used instead of overt anaphoric 
devices, the local discourse pattern almost always provides certain 
cues that can be associated with it, a phenomenon that happens too 
frequently to be attributed to accidental appearance in discourse. 

From examples 8-11, we can see that it is often the verb that 
serves as the discourse cues. The manipulation of verb forms could 
be one of the means of constructing Chinese discourse patterns with 
regard to the use of zero anaphora. 

Another pattern is the apparently headless relative clause 
whose head is actually encoded by a zero. Following is an example. 

15. (Beijing, p. 71-76) . ^....^ 

1. A: Ni -men juti q&o de shi shenme, biyie she^i? 

you-PL exact make NOM be what graduation design 

2. E: W6 q&o le yi mazui qiangl. 

I make Asp one anesthesia gun 

(A and E continued talking about the anesthesia gun, 

thirteen more turns . ) 

(A, G and B talked about water melon seeds, and about 

A ' s mother , fourteen more turns . ) 

(A, B and E talked about Lin, a person not present at 

the conversation, twelve more turns. There are about three and 
a half pages of transcription from line 7 to this point.) 

->42. A: Ao, na ni Q&o nei -ae 01 jiu shi, dSng -yu 
OK then you make that-CL just be amount-to 

01 da -chu-qu yl -hou zhi -jie jiu, nei -ge 

shoot-out-go then-after direct-connect right that-CL 

yao jiu zhi -jie zhtishe jin -qu la, ha? 

medicine right direct-connect inject enter-go Asp Q 

43. E: Ta nei zhen -tl ba, ... 
it that syringe-body RF 

1. A: For your graduation project, what exactly did you design? 

2. E: I designed an anesthesia gunl. 

->42.A: Oh, that (syringel) that vou designed is just like, (it) 
amounts to the fact that after (itl) is shot out, (it) 
immediately — the anesthesia medicine gets injected right 
away, is that right? 
43.E: Of the internal part of syringes, 
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This example presents another instance of the repetition of the 
verb associated with a zero anaphor. This time the verb is inside a 
relative clause with the head (syringe) omitted. Headless relative 
clauses seem to be one of the structures that is often used to 
present information about referents without mentioning the referents 
in discourse (they occurred in both the written and conversational 
discourse data) . I have not done a systematic study of this 
construction, but the structure appears to be one of the means to 
associate the verb with a zero anaphor to present information about 
its referent. 

Notice that in this example, the speaker tries very hard to 
return to the previous discussion of the graduation design by using 
a headless relative clause and detailed description of the function 
of the syringe, but the overt referent m^zui aiana ^anesthesia gun' 
was never mentioned*. 

This section has tackled the question of grammar. The 
discussion maintains that the construction of Chinese discourse is 
extremely context dependent, as is reflected in the switch reference 
pattern with the use of zero anaphora. Each occurrence of a zero 
could trigger a construction of a local discourse unit whose 
function is to facilitate reference-tracking to make a coherent 
understanding of the language. Thus discourse production and 
comprehension should be taken as on-line mental activities. 



4. Conclusion 

This paper has mainly discussed two issues: how the pattern of 
switch reference is coded cross-linguistically; and how Chinese 
speakers track anaphoric referents in the pattern of switch 
reference with the use of zero anaphora. The paper concludes by 
examining the cognitive impact of zero anaphora on Chinese speakers 

In section 3.1, the paper suggests a set 
of cognitive strategies termed ^emergent 
reference' that could be utilized by native 
Chinese speakers. The paper also suggests that 
the strategies could also be part of the 
primary cognitive strategies of speakers of 
all zero-anaphora languages. The study 
suggests that these cognitive strategies may„ 
not be primary in speakers of languages where 2 
zero anaphora is not one of the major^ 
anaphoric coding device; thus there is a^ 
cognitive difference among speakers ofl 
different languages. | 

A preliminary experiment comparing native^ ' 
Chinese and native English speakers suggests 
that Chinese speakers seems to be sible to 
handle discourse passages with abundant use of 
zero anaphora better than can native English 
speakers . ' . ,. 

. Full MMngNF 

The experiment used a set of English 

^ Passage type 




* This phenomenon is termed ^a return-pop'. For a detailed 
discussion of the return pop, see Fox (1987) and Tao (1990, 1993). 
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passages from a standard test, each passage is accompanied by six 
four-alternative multiple choice questions. Half of the passages had 
some grammatical subjects or objects missing (resembling the zero 
anaphors in Chinese) . The results shown in the figure here indicate 
that even though native Chinese speakers made more errors with the 
full passages, native English speakers made significantly more 
errors in the passages that had Chinese type of zero anaphora. The 
results indicate that it could very likely be due to the different 
cognitive strategies that native Chinese speakers scored better in 
processing discourse with abundant use of zei*o anaphora. 

In sum, the abundant use of zero anaphora as reflected in the 
switch reference pattern in Chinese discourse illustrates the 
contextual nature of the language. The phenomenon calls for our 
attention to the nature in discourse formation. It also demonstrates 
what the study of discourse can tell us about our language and about 
its speakers. 
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(functional refinement) ° tKki);(Ul]^III|!Hke(niJ77:i:Slf(;I^A ' ^ ^mm)) ° ^£ 
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^^^^^ o mmmm • ""n^j j^^^ft^^Mmj^eiJi?)* • jSHi^T^BtAnti^A*!^ 

fi^j ^mfi^^^j fnx^^u ^Mi'pj ° ^MM'^j mm ' miLj 
' ^i^^ij 'mm%^\j mi^^mm 'mm^A ±nm^. ° xmmmm 
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\-i ■mmm'^.^mm'^ ° iii^^it^u ^im^Hj (topic prominent) ' 'mm\ 
B^j (pro-drop) ^ wMk^ ^ n^MBm'mm ° '-3^' m.^^mm^im-m^-m^n.^ 





1958 




1993 




1958 




1958 


M- -^fa; [Sn 


1982 




1980 




1990 




1989 


MEM 


1986 




1989 


Emm 


1985 




1957 




1990 




1990 




1984 








1983 




1983 


MM 


1986 




1988 


MM 


1986 




1982 




1992 




1979 




1990 




1993a 



1993b 



<m^''H]{mm^mmm> • «'4^iiig3:c»i958.3 : 

117-8 ° 

«igW^M^»2_:84-99 " 

«MixmmAm^» • ^T^ep^i^ ° 

1989.2:123 ° 

X» 1986.3:204-216 ° 

<^a^ rjg_, :^^6^^jifn^^> . 1989:1- 
9 o 

<mwmmm.^> • 1:17-33 ° 

</t5i5me5':6^5li^?II> • 1990.3:191-206 ° 
15:303-329 ° 

<^fi^^gitfB§i^> • 1983.3:201-6 " 
10:367-378 ° 

3i» • 114-128 ° 

<PtftMtt5'^I^S?^ilM55-1-:ff> • Journal of Chinese 
Linguistics 16.1:55-71 ° 

<^'n•^m^wm> • 15:277-302 ° 

<eS®ii^«Jr£l^;Ji^> • 8:121-179 ° 

flj» 62.4:717-786 " 

rifj, i^» 2(OT) ° 
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Word Order Flexibility in Chin^ise: 
A Typological Study of Mandarin, Min, and Yue Dialects 

Zhiqun Xing 
University of Washington 



1. INTRODUCTION 

It has been generally agreed among linguists that Chinese has^^^^ (SVO) 
word order (Li and Thompson 1974a, 1974b, 1975a, 1975b, 1981; Tai 1973, 1976, 1978 
Light 1979; Sun and Giv6n 1985; Xing 1993 among others). Its alternative is OV order in which 
the subiect can either precede the verb yielding OSV form or precede the object yielding SOV 
form When the latter form is used, the object is often marked by the morpheme Js, b a, W 
iinf,oMinnQ/t<iuene ^ ka f^ttui or ^ lianAin, depending upon dialects (Zhao 1947, Li ly^U, 
Sfl^f S^i98o!^eng Yuan 1989, Y^g 1991, Hashimoto 1993), as illustrated in 

1-3.1 

1. Mandarin: ^ 1^1 ^ 1 ° 

Isg. obj. M rice all eat asp. 
'I ate all the rice.' 

2. Min: ^-^1^0^^^^^^ " 

3sg. obj.M one egg break-go 
'He broke an egg.' 

3. Yue: f^l^ "^i^ 

you neg-good obj.M this- thing make-break 
'You should not break this thing.' 

Traditionally, researchers working on Chinese dialects focus much of their study on 
matching one type of OV order in Mandarin with that in the other dialects (Li 1959, Gao 1980^ 
Yuan 1989 among others). For instance, a number of studies explore whether the 5>-fta-L'-v 
construction (henceforth, the ba construction) used in Mandarin exists in the Min or Yue dialect 
This kind of study is useful; however, it is not sufficient to reflect the fundamental difference or 
similarity among the dialects and, furthermore, to draw a typological conclusion ot Chinese 
languages. 

The present paper is a comparative study of the factors that determine the use of OV order in 
Mandarin, Min, and Yue dialects. All OV orders in these dialects, with or witiiout subjects, 
have been studied. The results of this study show that the relative frequencies of OV orders in 
these three dialects are more or less the same, however, they differ from one another m terms ot 
when objects need to be marked and which object marker is commonly used. Apart from these 



II wish to thank Lin Li-ching, Yeh Ling-hsueh, and Holly Lam for their help m verifying 
the Min and Yue data. I did not provide phonetic transcription in the examples because ot my 
limited knowledge of the different pronunciations among the three dialects. 
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differences, I argue that the conditions which govern objects to be moved to preverbal position 
can be observed from three aspects: a) the syntactical consideration: the existence of a 
postverbal complement; b) the semantic consideration: the affectedness of objects/patients and 
me meanmg of object markers; and c) the discourse pragmatic consideration: the contrast/listing 
tunction and the functions of promoting patients and/or demoting agents. 

The organization of this paper is as follows: Section 2 provides the results of text counts of 
Uie relative frequency of OV orders among all transitive constructions in each of the three 
dialects. Section 3 discusses various semantic, syntactic, and discourse-pragmatic functions of 
uiirsnTdy ^ ^" dialects. Finally, section 4 gives a conclusion and implications of 

2. RELATIVE FREQUENCY OF OV ORDERS IN THE THREE DIALECTS 

The occurrences of OV constructions among all transitive clauses have been counted in 
naturally occumng texts2 in the three dialects. The results of the counts arc given in Table 1 : 

Mandarin Min Yue 

Category No. of Tokens ^No. of Tokens No. of Tokens 

X?, / u- w ^'^^ (91.02%) 853 (92.02%) 835 (93.40%) 

OVw/obj.M 61(6.37%) 35 (3.77%) 24(2 68%) 

OVw/oobj.M3 25(2.61%) 39(4.21%) 35(3.91%) 

Total 958(100%) 927(100%) 894(100% ) 

Table 1: Relative frequency of OV order among all transitive constructions in the 
three dialects 

The results in Table 1 are instructive in two ways: one is that the overall frequencies of OV 

iXS^'' "^'^ '^^^^"i^o^^j^ '"^^"^ ^" *e ^ dialects are sSnilar: less than 
10% of all transitive constructions (8.98% in Mandarin, 7.89% in the Min dialect, and 6 59% in 
the Yue dialect); the other is that in Mandarin, more than two-thirds (6 37%) of OV 
constructions have a marked object, while in the other two dialects, more than a half of the 
objects m OV constructions are NOT marked (4.21% in the Min dialect and 3.91% in the Yue 
direct). Statistically, the difference between the number of the marked and urnnariced oK"^ 
O V constructions of the three dialects is significant (chi-square test: p<0.0 1) 



Y.v, u -^ ^^rJr^^i^' ^yjfongLouMeng' m¥. 2yChen Rong Zhongpim 
^S-^^^."^-^' ^^*«f;^K?(Ro"gChen, 1983), 3) 'Gongkai de Neican' {NeicJ)mm^ 
^angkang Zhang etal, 1986). Min: 1) XhaoThou Fangyan' mmt (Zhan 1959) 2) 
Spoken Amoy KoWcien' (Bodman 1955-1958), 3) 'Gufhi' in Dong W 1967), 4) 
iZlinTyitr ^r?oWWJ^^.^^^^^- ^^^^^ (Chao 1947), 2) 

this study, OV order without an object marker includes those constructions in which 
there IS no syntactic subject; however, from context, one may figure out the semantic agent of 
^e verb. For instance: mmmmmj o 'Everything for the wedding has SpSd ' 
^^n^t^'^tSirnT^f ''/"^^^'^^ the sentence, theoretically, it is not appropriate to cil the 
consmiction OV order (see d-taiied discussion of this respect in LaPoUa 1991 and Xing 199^) I 
use OV order to refer to the iUcstrat? H sentence purely for the puipose of convenience ' 
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Furthermore, objects in OV constructions of the three dialects are found to be marked 
differently: sometimes by ba, sometimes by tsiang/tsueng ka, or lianAin. Ph^^o^^^^^Qof 
been noted in literature (e.g. Zhang 1972, Gao 980 Teng 1982 Yuan J989 Janf 1991 
Hashimoto 1993); however, we do not know the distnbution of the different object markers in 
the three dialects. The current study, based on the results of text counts of the frequency of OV 
constructions in the three dialects, found that in Mandarin almost all ^^J^^ct markers are 
and a few of them are it Han. In Min and Yue dialects, on the other hand, the conmion object 
marker is Jjf tsiang/tsueng; tb.e Min dialect also uses ^ tui orj^ ka to mark objects while Yue 
sometimefuses 4/m to mark objects, as in Mandarin. These differences lead us to ask 
whTC^y refl* certain functional differences of OV constructions in the three d^^^f^or 
Aey are merely lexical variables. Hence, the following secUons wi 11 discuss the USE of OV 
constructions (both with and without an object marker) in the three dialects and find out whether 
aU O V constructions in the three dialects are governed by the same Imguistic constramts. 

3. CONDITIONS ON THE USE OF OV ORDERS 

Cross-linguistically, word order alternation can be governed by a number of factors. Some 
languages (e g. German, English) use different word orders to meet syntactic requuements (ej 
in Geman: main clause vs. subordinate clause); some languages (e g. English, Hebrew) 
alternate word order due to discourse pragmatic consideration (e.g. old information vs ne w 
information). In this section, I will examine the use of OV constructions in the three dialects 
from three aspects: syntax, semantics, and discourse pragmatics, to see what causes objects to 
be moved to preverbal position and whether we can draw a typological conclusion for the 
function of OV constructions in the three dialects. 

3.1. SYNTACTIC CONSIDERATION 

In section 2, it was shown that the relative frequency of OV constnictions in Mandarin is 
8 98% among all transitive constructions, in the Min dialect it is 7.89%, and in the Yue dialect it 
is 6 59% These results give us a general idea of the distribution of OV constnictions in the three 
dialects, yet they cannot tell us whether the 8.98% of OV constructions in Mandann are Ae same 
type of constructions as the 7.89% of OV constructions m the Min dialect or the 6 59% of OV 
constructions in the Yue dialect. In literature, it is often mentioned (e.g. Zhang H. 1972, znang 
Z 1983 Yuan 1989) that the tsiang/tsueng constiiiction (one kind of OV constiiictions) m Mm 
and Yue is equivalent to the ha constiiiction in Mandarin. Here it is not clear to me whether 
'equivalent' refers to the structure of ba and tsiang/tsueng constinictions in the three dialects, or 
\hefitnctions of these consOiictions, or both. 

In this section, I will provide evidence and argue that all tsiang/tsueng consttuctions in Min 
and Yue are stiiicturally equivalent to ba constnictions in Mandarin, but not all constiuctions 
are stnicturally and functionally equivalent to tsiang/tsueng constnictions in Min and Yue As 
far as the other OV consti-uctions (with or without an object marker) are concerned, tiieir 
stiiictures and functions also differ in varying degrees among tiie tiiree dialects. Let us first look 
at the following example: 

1st obj.M det-cl. book tianslate-into Chinese asp. 
'I have translated this book into Chinese.' 

Sentence 4 has the stnicture: Subjcct+Object Maiicer+Object+Verb+Resultative complement. 
Textual analysis shows that if a verb carries a resultative complement, such as the one in 4, 
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translate into Chinese', its object has to be moved to preverbal position and has to be marked no 
matter in which dialect or witii which object marker.4 in this case, both structure and meaning 
ot the sentence m the three dialects are the same; the only difference is that Mandarin uses if. ba 
as the object marker, while Min and Yue use ^ tsiang/tsueng. Interestingly, when the 
postverbal complement is other type (e.g. directional complement, double objects), the situation 
among the three dialects is complicated and needs to be discussed case by case. 

Textual analysis reveals that all tsueng constructions in Yue can be converted to i& ba 
constructions m Mandann witiiout changing the meaning of the sentence. However the reverse 
situation IS not the same. That is, ba constructions in Mandarin are not always converted into 
tsueng constructions in Yue. Consider the following examples: 

5. Mandarin: a. ^^:je.^#JM|L^T ° 

he first obJ.M money deposit in bank 
'He deposit the money in the bank.' 

he first deposit money in bank 

6. Yue: a. 'fE#iiu6<;^X|(L:Jt^ o 
he deposit money in bank first 

; first deposit the money in die bank.' 



'He 



he obj.M money deposit in bank first 
'He first deposit the money in die bank.' 

Notice tiiat the ba construction in 5a can be converted into either 6a or 6b, both of which 
are grammatical yet their structures are different. 6a has SVO word order and 6b has SOV 
order m which the object is marked by tsueng. This seems to suggest that die flexibility of 
word order m Mandann and Yue is different. We see that the MandSin ba construction iS^ 5a 
n^nnnt Converted mto two types of Yue constructions in 6, but the tsueng construction in Yue 
stX yrnr^QQ?' 7?" two types of Mandarin constructions as shown in 5. Previous 
S PA^SioT i£ ; ?• f '^"'•y ^^^^^ this is because in Mandarin an 

?n thi ^^ PO?tverbal position when there is a postverbal phrasal complement 

m the sentence while m Yue it can. This is probably one of the reasons why the Relative 
constructions in Yue (6.59%) is somewhat lower than that in M^darin 

eaui^Senf fn'Ihf ^^"l assumption tiiat the ba construction in Mandarin is not always 

Sn7h. M ! ^ '"'''"^ construction m Mm is that many ba constructions, when converted 
mto die Mm dialect, may become OV constructions without an object maricer. For instance: 

7. Mandarin: a. #^&ni|ji4e >}l o 

you obj.M door close-up-come 
'You close die door.' 



r « 5 P?^"tei°"t this refers to the situation when die subject precedes the obiect 

( e. the soy word order). The example in 4 can also be converted into in OSV word orS 
all three dialects m which the object does not need to be marked 
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you door close-up-come 

8. Min: a. jj^lSH (IS^P-) W^3il o 
obj.M door close-up 
'You close the door.' 

b. mm{mp-)^M.^° 

you door close-up 
'You close the door.' 

The Mandarin ba construction in 7a can be converted into two constructions in Min, as 
shown in 8a and 8b: one with an object marked by tsiang, the other with an object unmarked. 
But in Mandarin, only one construction in which the object is marked is allowed. This provides 
evidence that the Mandarin ba construction in 7 can have two interpretations in Min, while the 
two variats in Min can have only one interpretation in Mandarin. Therefore, we cannot say that 
the ba construction in Mandarin is always equivalent to the tsiang construction in Min. 
Examples in 7-8 also provide evidence that objects in Min are more free to be moved to preverbal 
position than those in Mandarin. 

The results of a survey study of the difference between the two variants in 8 show that the 
two constructions differ in style and function: the one with a marked object is more formal than 
the one with an unmarked object; but the latter is more likely to have the function of emphasis 
tiian the former. 

We have seen that OV constructions with a marked object are not always equivalent to one 
another structurally in the three dialects. In the following, I will show that OV constructions 
without an object marker are not always equivalent to one another in the three dialects either. 
Consider the following examples: 

9. Min: a. 

he shirts wash-fmish can come 

'After he finishes wasWng the shirts, he will come.' 

b. ¥-'jL-f(f^it^ ° 

he wash shirts finish can come 

'After he finishes washing the shirts, he will come.' 

10. Yue: a. m^'^^r»&.it^ ° 

he shirts wash-finish can come 

b. it^^*fr^&.it^° 

he wash shirt asp. can come 

'After he finishes washing the shirts, he will come.' 

1 1 . Mandarin: a. ?^4C)i5Li5b^ 7 it^ ° 

he clothes wash-fmish can come 

b. ^^'i^l^^ki^^o 

he wash-finish clothes can come 

'After he finishes washing the clothes, he will come.' 

According to Yang (19^1), the OV order without an o^ect marker is preferred in the Min 
dialect when there is a sentential complement, such as ^ 'will come' in 9a. In Yue and 
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Mandarin, however, the OV order is NOT preferred because of the very same reason, as shown 
m 10a and 1 la; rather, the regular VO word order is likely to be used, as shown in 10b and 1 lb. 
I assume that this is associated with the principle of sequencing in Mandarin and Yue. To prove 
this, however, requires further investigation. 

To summarize, I have shown in this section that OV constructions with a marked or an 
unmarked object in the three dialects are not always structurally equivalent to one another. The 
results of this study suggest that the OV construction must be used in Mandarin when there is a 
postverbal phrasal complement in the sentence, but not in Yue. On the other hand, if there is a 
sentential complement, the OV construction is preferred in Min, however, this is not the case in 
Yue nor in Mandarin. Another way to state this is that the scope of OV order with an object 
marker is syntactically wider in Mandarin than in Yue, but the scope of OV order without an 
object marker is syntactically wider in Min than in Yue and Mandarin. 



3.2. SEMANTIC CONSIDERATION 

In addition to the syntactic factors discussed in the preceding section, the use of OV 
constructions (witii or witiiout an object marker) in the three dialects is also found to be affected 
by different semantic factors. In literature, it is often mentioned that tiie verb in tiie OV 
construction (with an object marker) expresses a 'disposal' action (Chao 1968, Zhang 1972, Li 
1974, Gao 1980, Li and Thompson 1981 among others) and has to convey a perfective or 
ttansitive meaning (Hopper and Thompson 1980, 1984; Mei 1981 ; Jin 1993). For instance: 

12. Mandarin: a. -f&IC^-f tfft 7 « 

he obj.M glass hit-break asp. 
'He broke the glass.' 



b. 



he obj.M glass hit 



c. 



he hit-break asp. glass 
'He broke the glass.' 



13. Min: 



a. 



I obj.M bowl-rice eat-fmish. 
'I ate a bowl of rice.' 



b. 




c. 




14. Yue: 



a. 



you want obj.M people frighten-die excl. 
'You want to frighten people to death?!' 



b. 



you want obj.M people frighten excl. 
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you want frighten-die people excl. 
'You want to firighten people to death?!' 

In 12-14, the verb has some kind of disposal meaning, or, rather, the object is somewhat 
affected by tJie action of the verb. In addition, all three sentences have a word or morpheme 
immediately after the verb expressing the perfective aspect of the action of the verb, i.e., 
tr^l hit-break, ^7 eat-fmish, -^^L frighten-death. Without the perfective morpheme, 
diese sentences would be unacceptable, as shown in 12b, 13b and 14b. This provides evidence 
that the vert) of OV constructions with an object marker needs to cany a perfective aspect and the 
object is affected by the action of the verb. However, no evidence suggests that when the verb 
conveys the meaning of disposal, OV order HAS to be used, because the basic VO order is 
perfectly fine, as shown in 12c, 13c, and 14c. The conditions that govern the choice of OV 
order or VO order were found to be associated with discourse pragmatic factors, which will be 
discussed in detail in 3.3. 

The results of the previous studies are informative in the semantic function of the verb in OV 
constructions with an object marker, ba or tsiang/tsueng. However, there has not been much 
study of the semantic consideration of the remaining OV constructions, i.e. those without an 
object marker or those with other object markers such as ^ lianAin, $^ tui, ka, so we do 
not know whether the verbs of the OV constructions without an object marker in the three 
dialects are the same as those OV constructions with an object marker, ba/tsiang. My own study 
(1993) of Mandarin texts and the results of this study show that the verb of OV constructions 
without an object marker or with some other object markers,^ ^ lianAin, fj- tui, ^ ka, do not 
always convey the meaning of disposal, as shown below: 

15. a. it^-^^M-^o 
this sentence I like 
'This sentence, I like' 

b. o 

he emph.M that sentence like 
'Even that sentence, he likes.' 

c. *^^(m)^^m-iko 

I obj.M this sentence like 

d. *W^U4-ik. o 
he that sentence like 

The sentence in l.-^a has an OSV structure with no object marker. Notice the verb 4.<j: 'like' 
in it is stative and does not have the disposal meaning, but it is grammatical. Adding the object 
marker i|. lion when the sentence is converted into an SOV order is also acceptable, as shown in 
15b; however, adding the object marker ba/tsiang/tsueng or leaving it bare makes it 
unacceptable, as shown in 15c and 15d. This is true of all three dialects. 

We see the semantics of verbs in OV order among the three dialects is more or less the same. 
That is, when the object is marked by ba or tsiang/tsueng, the verb is most likely to express the 



5'Object marker' here broadly refers to any markers used before objects, even though 
lion in Mandarin could well be an emphatic marker and |& ka in Min be a dative marker, etc. 
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meaning of disposal, or the object is mostly likely to be affected; when the object is unmarked or 
marked by another form, however, the verb does not have to have the disposal meaning. 

At this point, it becomes interesting to see whether objects and object markers in OV order of 
the three dialects have the same semantic function. As it is known, the prototypical semantic 
relationship between a verb and an object is that the verb projects an action to the object and the 
object functions as a receiver or patient of the projected action, so it is a give-take relationship 
between them. Object markers, on the other hand, do not normally have any semantic nor 
pragmatic function themselves, rather they only serve to distinguish the relationship between 
noun phrases and verbs (see Comrie 1981:117-121). Investigation of objects and object 
markers in the three dialects reveals that both objects and object markers do not always retain the 
prototypical functions mentioned above and sometimes differ from one another among the 
dialects. Consider the following examples: 

16. Min: ^M'^^f^P'^ik ° 

I obj.M foot step he/him two time 
'I stepped on him twice with my foot.' 

17. Mandarin: ^H^'^^^ftl o 

he obj.M child starve-die asp. 

'He caused (his) child to starve to death.' 

18. Yue: '(i5;^l^^*:iK«^«fL ° 

you obj.M he/him angry to blow-bear-stare-eye 

'You caused him to be angry to the extent that he frothed at his 

mouth and glared with rage.' 

In 16-18, the object marker is either ba or ^ tsiang/tsueng depending upon the dialect, 
and all noun phrases, 'foot', - 'child', and -(E 'him' "ter 'the object marker, are not 
semantically the receiver/patient of the actions of the verbs, ^jj- 'step', ^ 'starve', and Jlt 
'make.. .angry' respectively; instead, they are the receiver/patient of the object marking ba or 
tsiang/tsueng, serving an instrumental function as in 16 or a causative function as in 17 and 18. 
According to my previous study (1993:142), few ba constructions in Mandarin have the 
causative or instrumental function in modem texts, however, historically it is not uncommon for 
jiang to serve that function. In comparison, tsiang constructions in modem Min (e.g. Chaozhou 
dialect), according to Li (1959:256), OFTEN have the instrumental function, as shown in 16. 
This seems to suggest that the functional difference between ba in Mandarin and tsiang in Min is 
reflected in the process of ba and tsiang's grammaticalization. Ba in Mandarin represents its 
latest function (e.g. object marker) in the disposal form, while tsiang in Min represents the 
relatively old function (e.g. instrumental marker) of the disposal form in Chinese.^ If this is 
true, it helps us understand better why the relative frequency of the tsiang construction in Min 
(and possibly Yue) is lower than the ba construction in Mandarin. Logically, the chances to use 
them as object markers are far more frequent than as instrumental or causative markers. 

As far as the semantic function of the other object markers are concemed, it is common for 
if. lianAin and tui to be used as an emphatic marker and $^ gei/ka used as a dative marker. 
As this subject itself is big enough to write a separate paper, I will not discuss it in detail. 
Interested readers can consult Li (1959), Teng (1982), Xu (1990), and Cui (1993). 



^In fact, this is part of the reason why Hat jiang construction in Mandarin is rarely used in 
modem texts. Detailed discussion of this subject can be found in Xing (1993). 
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In this section, we have discussed the semantic function of verb, object, and object marker in 
OV constructions in the three dialects. Evidence shows that even though the verbs m OV 
constructions with an object marker, ba or tsiang/tsueng , in all three dialects are most likely to 
convey the meaning of disposal, the verbs in OV constructions without an object marker 
ba/tsiang/tsueng or with other object markers are not always the case; they can express a stative 
situation, such as A^fe 'like'. Other results suggest that the semantic function of ba and 
tsiang/tsueng phrases vary among the three dialects. Ba in Mandarin almost exclusively serves 
as an object marker, so the noun phrase immediately following it is almost always the 
receiver/patient of the action of the matrix verb. Tsiang in Min (maybe in Yue), on the other 
hand is often used as an instrumental or a causative marker, hence the noun phrase immediately 
following it is not the receiver/patient of the action of the matrix verb. I argued that this 
difference between Min/Yue and Mandarin represents the different functions of ba/tsiang/tsueng 
phrases in the course of their grammaticalization, which, in turn, explains the frequency 
difference among ba and tsiang/tsueng constructions in the three dialects. 



3.3. DISCOURSE PRAGMATIC CONSIDERATION 

We have seen that both syntax and semantics can affect the choice of word order in all of the 
three dialects. Apart from these two factors, this study also found that there are a number of 
discourse pragmatic factors which are essential in distinguishing word order vanants in the 
dialects studied. Hence, this section will focus on which discourse pragmatic factors are 
important and how they constrain the use of different word orders in the three dialects. 

Textual analysis shows that among other things, the motivation for using OV order rather 
than VO order in Mandarin, Min, and Yue is to put relatively important objects in preverbal 
position Now the question is how we can tell whether objects are important. In the following, 
I will provide evidence showing: 1) The objects in OV constructions with a subject tend to be 
definite and important. As definite nouns refer to entities that have been mentioned in previous 
discourse, coding tokens for definiteness helps us to identify which noun phrases are relatively 
more common and hence more important in discourse than others. 2) OV constructions without 
an object marker tend to have emphatic function: contrast/listing. Presumably, if an object is 
contrasted/listed, it is more important than other entities in the discourse. 



3.3.1. DEFINITENESS 

Definiteness is not a new term in the study of OV order in Chinese. In fact, it can be dated 
back to Mulie (1932) who first suggests that preverbal objects in Chinese are defimte. In the last 
two decades, two groups of researchers provide convincing supporting evidence for that 
suggestion: one group is Li and Thompson, and the other is Sun and Givdn. Li and Thompson 
(1974a, 1974b, 1975a, 1981) claim that SVO order characteristically codes indefinite objects and 
OV order codes definite objects; however, they do not give quantitative data supporting this 
claim. Sun and Giv6n's text coum (1985:336-337) indicates that 49% of the objects in VO order 
are indefinite, while roughly 80% of the objects marked by 'ba or 'jiang' are definite in OV 
order. Both Li and Thompson's and Sun and Givdn's studies reveal a general tendency 
regarding the definiteness of subjects and objects in the SVO and OV constructions. However, 
it is not clear in either Li and Thompson's or Sun and Giv6n's study whether OV order includes 
those constructions without a subject. In addition, all of these studies are based on Mandarin 
Chinese, not other dialects. 

To code definiteness in Chinese is not as easy as in English, since there is no definite article 
in Chinese corresponding to the English 'the' (see Givdn 1978, 1984 for the definition of 
definiteness in English). In Mandarin, some noun phrases (NP) have demonstrative modifiers. 
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zhe this and nei 'that', by which we know they are definite; however, other NPs do not have 
any overt marker to show whether they are definite or not, so they must be determined from 
^ntext. Xmg (1993:22) codes all NPs (subjects and objects) in the SVO and OV constructions 
(both wij and without an object marker and a subject) collected from short stories and novels as 
either defmite or indefinite by the following definition: 

i. It is mentioned in the preceding discourse and it is known to both 
speaker(s) and listener(s). 

ii. It is not mentioned in the preceding discourse, but it is known to both 
speaker(s) and listener(s), because it is uniquely identifiable. NPs, such 
as taiyang 'the sun' and yueliang ^ ^ 'the moon', belong to this 
category. 

iii. 'Inferrible' noun phrases, such as 'his head', 'my apartment', etc (see 
Prince 1981) 

The results of Xing's text counts show that subjects in all constructions are definite more 
than 85% of the time. Objects, however, do not always tend to be definite or indefinite in OV or 
VO order; rather, they are split by whether the OV construction has a subject and an object 
marker. That is, the object in the OV order with a subject and an object marker is much more 
likely to be definite (80%) than those in VO order and the OV order without a subject and an 
object marker (53% for both cases). Compared with Sun and Giv6n's findings (1985) and Li 
and Thompson's findings (e.g. 1981), Xing's results are more detailed and, therefore, more 
informative. 

Using the same defmition given by Xing above, ^ ja-.-e coded all NPs of OV constructions in 
the Mm and Yue texts studied for this paper as ei'j;cv def-nite or indefinite. The results are as 
tollows: the objects m the OV order with a subje^ i and an object marker in both dialects tend to 
be definite (86% m Min and 90% in Yue), while f*^ jse m the OV order without a subject and an 
object marker do not have that tendency. Notice that these results are the same as those found in 
Mandarin by Xmg (1993), even though the relative frequency of the OV order with an object 
marker m the three dialects varies as discussed earlier. In this case, I conclude that in Chinese 
OV order is likely to be used under two circumstances: 1) when there is a subject in the target 
sentence and the object is more important (e.g. definite) in discourse; and 2) when there is no 
subject m the target sentence, in which case the object does not have to be definite. A natural 
question to ask at this point is that if tiie object is not definite, why is OV order used, instead of 
tile regular VO order? The following section makes an effort to answer this question. 

3.3.2. EMPHATIC FUNCTION: CONTRAST AND LIST 

f .-5 literature, tiiere has not been much research comparing die discourse pragmatic functions 
ot ditterent word orders m Chinese, and even less research comparing the functions of difi^erent 
orders m tiie three dialects. However, tiierc have been a few studies of the functions of tiie OV 
?n7foV"A^, • i^ marked by ba/jiang (e.g. Sun and Giv6n 1985, Tsao 1987, Hu 

1993). A claim made by Sun and Giv6n (1985) tiiat most OV orders (with or without tiie 
object marker have the conti-astive/emphatic function. In this section, I will first argue that 
Sun and GivOn's claim is not tenable; tiiey over-generalize the function of OV orders in 

''It is not clear whether OV here refers to SOV, OSV or OV (without any subject), or all 
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Mandarin. Then, I will provide evidence of the constructions that do have contrast/emphatic 
function in the three dialects. 

Sun and Giv6n (1985), to my knowledge, are the first to discuss the correlation between a 
certain discourse function and tlie use of a certain construction in Chinese. The main focus of 
their study is to investigate the discourse function of different word orders in Mandarin Chinese 
based on recorded speech and written texts. One of the claims they make is that 'the functional 
distribution of the OV construction, both with and without the OM (object maricer=M), strongly 
tags it as a marked, specialized, contrastive/emphatic discourse device' (1985:348). This claim 
poses a number of problems. 

First of all. Sun and Giv6n, like other researchers (Kuno 1973, Chafe 1976, Giv6n 1984 
among others), do not define 'contrast'. They use a text count method called Potential 
Referential Interference (PRT) to diagnose the emphatic/contrastive nature of constructions. The 
problem with this approach is that the PRI measure does not seem to be objectively applicable to 
naturally occurring data (see detailed critique of this matter in Myhill and Xing 1993). The 
second problem is that their claim for the correlation between the function of contrast and OV 
constructions is overstated in that we can easily find counter-examples in any naturally occurring 
texts. Consider the following examples: 

19. ^X^^^f^ -J o (Chen Rong, p. 349) 
passport issue come asp. 

'My passport has been approved.' 

20. n Pfl 7 o {Honglou Meng, p. 168) 
you just obj.M garden door close asp. 

'You go and close the door of the garden.' 

Both 19 and 20 are OV constructions; the difference is that one has an object marker and the 
other does not. Sentence in 19 is used when a person tells her best friend that her passport has 
been approved and she is leaving in a few days. In this case, there is no constituent, either in the 
preceding or the following discourse, with which the NP ^Jg. 'passport' or the verb ^ 
'approve' is contrasted. The speaker is simply telling her friend a fact. Hence this OV 
construction is not contrastive. This is also true for example 20, which is used when a man 
orders one of his servants to close the door of a garden. In the surrounding context, we do not 
see any other NPs compared with ® 'door of a garden', nor any verb compared with Ui 
'close', so that sentence 20 is not used contrastively either. If Sun and Giv6n consider both l9 
and 20 as contrastive constructions, it is not surprising to see that their results support their claim 
(1985:348) that among other things the ba construction is almost always used contrastively. 

Aware of the problem with Sun and Giv6n's study, I propose an alternative analysis. That 
is, in Mandarin the OV order with an object marker ba does not have the emphatic function: 
contrast/listing; only the OV order without an object marker ba does. The latter type of OV 
orders includes (/ian)-OSV, S-Zian-OV, and OV constructions. The Min and Yue dialects, 
furthermore, exhibit a similar pattern: OV constructions with the object marker tsiang/tsueng do 
not have the contrast/listing function, while the remaining OV constructions often do. In the 
following, I will begin by defining the terms 'contrast' 'listing'. « Then evidence for the 
correlation of the contrast and listing function and the use of the OV construction in the three 
dialects will be given. 



^In this paper, I will not define the term 'emphasis' because it seems to me that emphasis 
is a general term which can be specified as 'contrast' and 'list'. 
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Definition. 'Contrast' and 'listing' are two distinct discourse devices, which have similar 
functional domains, as illustrated in 21 and 22. 



Both 21 and 22 have the function of emphasizing two things, the former by contrasting, the 
latter by listing. This is probably why some linguists (e.g. Givdn 1988) do not make 
distinctions between 'contrast' and 'listing'. In literature, the term 'contrast' has often been 
discussed and applied to different languages. It is known that different languages have different 
means of expressing such a function. For instance, some languages (e.g. English) can express 
contrast either by intonation, i.e. high pitch (Chafe 1976), or by using different word order 
(Givdn 1984), while other languages, like Japanese and Korean, use a particle, [wa and (n)un 
respectively], to indicate 'contrast' (Kuno 1973, Hong 1985, Downing 1987, Hinds 1987, 
Hook 1987, Ueno 1987, Isoe 1992). 'Listing', on the other hand, is rarely mentioned in 
literature: only Schiffrin (1992), to my kno>yledge, talks about 'listing' in recorded 
conversations in English and makes an effort to categorize the items in a list. 

All of these studies are valuable in working out the cross-linguistic pattern of contrast and 
listing functions. However, when linguists (Kuno 1973, Chafe 1976, Giv6n 1984 among 
others) discuss 'contrast', they either do not define this term at all (e.g. Kuno) using isolated 
examples to illustrate their points, or they define it so vaguely that it is impossible to apply to real 
data. For instance, an example that Chafe (1976) uses is 'Ronald made the hamburgers'. 
According to him, this sentence can represent either 'contrast' (similar to my 'liiting') or 'double 
contrast' (similar to my contrast) depending upon which element receives a I'igher pitch: If 
RONALD receives the highest pitch and stress and the reminder of the sentence ii- low pitched, it 
is 'contrastive'. He states that: 

'What is conveyed by such a sentence is the speaker's knowledge that Ronald, as 
opposed to other possible candidates the addressee might have had in mind, is the 
right selection for this role.' 

If both RONALD and HAMBURGERS receive high pitch. Chafe (1976:35) explains: 

'...it is the pairing of these candidates for these roles that is being asserted. That 
is, if we are to take possible pairings of agents with patients of MAKE in this 
particular situation, one of the correct pairings (the speaker asserts) is RONALD 
with the HAMBURGERS. (Perhaps SALLY made the SALAD but RONALD made the 
HAMBURGERS.)' 

This seems extremely difficult or impossible to apply to other languages in which intonation and 
pairing patterns are different from English. However, it does give us some idea of possible 
contrastive expressions in English. With certain modifications, it should be applicable to other 
languages as well. 



21. 



1fWlf7^^^»#TBflTt«.o (Neican, p. 10) 
name turn off asp tape recorder, turn on asp television. 
'Fu turned off the tape recorder and turned on the television.' 



22. 
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Myhill and Xing (1992) make a preliminary effort in this respect. The definition they offer 
for 'contrast' generally corresponds to what Chafe refers to as 'double contrast'. They differ 
from Chafe in that they give further detailed descriptions of the properties of contrasted/listed 
entities (e.g. NPs and Vs) in a given construction They point out that it is necessary that 
contrasted or listed NPs refer to entities which are elements of a SET^ and the verbs in such a 
contrasted or listed pair must have either essentially the OPPOSITE meaning or essentially the 
SAME meaning. In addition, they discuss different types of contrast, such as 'verbal/non-verbal 
contrast' and 'implicit/explicit contrast'. Verbal contrast refers to contrastive constructions 
which have their verbs contrasted, as illustrated in 21, while non-verbal contrast refers to 
constructions which have verbs with the same meaning but have their agents, patients, themes, 
etc., contrasted, as illustrated in 23: 



Explicit contrast refers to cases where both constructions in a pair are overt, as in the contrast 
example given in 21. Implicit contrast, on the other hand, refers to pairs which have only one 
construction overt, as illustrated in 24, and they often have words such as 'even', 
'air, ^ 'also, too', 'only', etc., as a signal for contrast or listing. 



In 24, the OV construction without an object marker (in the second clause) is an implicit 
contrastive construction. According to the context, the sentence can be interpreted as '(Before 
you entered the university) your face was often seen, but after you entered the university, your 
face is hardly seen anymore', so what is implicitly contrasted is that your face 'is seen' vs. 'is 
not seen'. 

The major difference between 'contrast' and 'listing' is then as follows: In a pair of verbal 
contrastive sentences, ONE or more non-verbal constituents are in a set relationship, and the 
verbs are OPPOSITE in meaning, as exemplified in 21, in which two non-verbal constituents 
form a set, 'tape-recorder' vs. 'television', and the two verbs are opposite in meaning, 'turn off 
vs. 'turn on'. In a pair of non-verbal contrastive sentences, TWO or more non-verbal 
constituents form a set and the verbs are the SAME in meaning, as exemplified in 23. In a pair of 
listing constructions, the verbs must be either identical or have essentially identical meanings and 
only ONE pair of non-verbal constituents are in a set relationship, as exemplified in 22. 

► The current study basically follows Myhill and Xing in defining the terms 'contrast' and 
'listing', except that I have modified the cases in which a set may be constituted so that the terms 
may be easier to apply to cross- linguistic phenomena. In the following, the revised criteria for a 
set are given: 

(1) Any pair of elements which are represented as complementary parts of a 
whole (e.g. this vs. that; East vs. West) can constitute a set. 



^he term 'set' was first introduced in analyzing word order variation by Ward (1985). 



23. MifL-^m^^^l 7 aHF a ^ t ° (Neican, p. 158) 

name one cl. money stuff to asp waiter hand middle 




'Kang stuffed a wad of money into one of the waiter's hands, 
Lu stuffed some money into his other hand.' 



24 i^;^ 7 , ^^IL:f ^ 7 ° m<^^n, p. 9) 

you study asp. college face all see neg. get asp. 
'Since you entered the college, your face is hardly seen.' 
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(2) Any pair of elements which belong to the same semantic domain (e.g. musical 
instruments: piano vs. violin, or siblings: brother vs. sister) can constitute a set. 

(3) Any pair of elements which generally do not belong to the same semantic 
domain, e.g. money vs. food, but which under specific circumstances function 
as if they do (when both are valuable things or necessary to life) can be 
considered as a set. 

(4) Any two or more people who live, travel, talk, etc., together can constitute a 
set, while they are together. 

(5) Any pair of elements having a parallel relationship to the members of the set 
(e.g. the daughters of two people who live, travel, talk, etc. together) can 
constitute a set as well. 

Given these five criteria, along with the basic property of 'contrast' (e.g. to contrast two or more 
verbs or nonverbal entities in a set relationship) and 'listing' (e.g. to list two or more entities in a 
set relationship), one should be able to code 'contrast' and 'listing' constructions from any text 
in any language. 

Correlation. Given the above definition, the OV constructions in the three dialects have 
been coded as either contrast/listing or non-contrast/listing. The results are given below: 

Category VO OV w/o ba/tsiang OW w/ ba/tsimg 

Mandarin unlikely likely unlikely 

Min unlikely likely unlikely 

Yue unlikely likely unlikely 

Table 2: Likelihood of the contrast/listing function in the three dialects 

*0V w/o ba/tsiang includes SOV, S-i^/IJ-Zlfe-OV, OSV, and OV constructions. 

Table 2 shows the likelihood of the contrast/listing function in the three dialects. From the 
table, we see that in all three dialects OV orders without the object marker ba or tsiang/tsueng are 
likely to have the contrast or listing function, but the other constructions are not. One may have 
realized that the OV order without a ba or tsiang/tsueng varies slightly among the dialects, so the 
contrastive and listing OV construction should also vary among the three dialects. This is true; 
however, it does not affect the general tendency of OV orders without an object marker being 
used contrastively. Evidence for this is that Mandarin and Yue often use lianAin to express 
implicit contrast, while Min tends to use tui to convey the similar meaning, as illustrated 
below: 

25. Mandarin: . . . it-f-#^6^jtl3jl o {Chen Rong, ^352) 

even hand raise-not-up-come 
'...even her hand cannot be raised.' 

26. Yue: ^^%tf(^^^X^^. , . . . (Gao Huanian, p. 225) 

even safe key he give to you 

'Even the safe's key, he gave to you, ...' 

27. Min: ifc 3jL o (Li Yongming, p. 256) 

even its head cut-out-come 
'Even its head was cut off.' 
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The object marker it, / in 25-27 sometimes can be omitted and the remaining sentences 
still have the contrast or listing function. This is especially true of spoken discourse when extra- 
linguistic marks, such as high pitch, are added to the sentences. This supports the assumption 
that OV constructions without an object markers are associated with contrast/listing functions. 

It should be made clear that 'likely' in Table 2 means that certain types of OV constructions 
are GENERALLY associated with the contrast/listing function, although they are not ALWAYS 
used in that way and it is certainly not the case that all OV constructions with the object marker 
ba are associated with the contrastive function, as claimed by Sun and Giv6n (1985). The non- 
contrast/listing OV construction without an object marker ba/tsiang/tsueng and a subject is found 
to have the function of demoting agent and/or promoting patient. Consider the following 
examples: 

28. Mandarin: 1 o 

college poss. big-door also lock asp. 

'The university's entrance door is also locked.' 

29. Min: if^M.^ ° 

shirt collect-up-come. 

'The shirts should be put in order.' 

30. Yue: '??.^6^^>t#'*§.^ ° 

these house all build-finish asp. 
'All these houses have been built.' 

All of the three sentences given above do not have an agent/subject, nor an object marker, 
nor the contrast/listing function. The OV order^o is used for one or more of the following 
reasons: 1) to leave out the subject/agent either because they are not important in the discourse or 
the speaker/writer does not know who the agent/subject is (e.g. examples 28 and 30), or the 
speaker/writer knows the agent/subject, but does not want to mention it explicitly (e.g. example 
29); 2) to put the patient/object in a relatively more important position (e.g. examples 29 and 30). 
Notice that all these reasons are closely associated with the pragmatic functions of prototypical 
passive constructions (see Shibatani 1985, 1988; Comrie 1988; Giv6n 1990), yet all the three 
constructions in 28-30 do not have any passive markers, such as ^ bei. Because of this 
discrepancy of form and function, some researchers treat them as passives (e.g. Zhang Zhigong 
1956, Wang Li 1957b, Gao 1980), others consider them as disposal forms (e.g. Yuan 19^9, 
Mei 1990). I take the stand of the first group of researchers because of my belief that all 
linguistic forms serve for functions, not vice versa; so if an OV construction without a 
subject/agent and an object marker has the function of passive constructions, it is a passive 
construction. I conclude that in the three dialects when OV orders do not have an object marker 
ba or tsiang/tsueng, they serve, at least, one of the two purposes: emphasis or promoting 
patients and/or demoting agents. 



lOAs mentioned earlier in footnote 3, for the convenience of readers the term 'OV is 
used to refer to the constructions as in 28-30 throughout this paper. Theoretically, it is not 
appropriate to use that term, because if the NP before the verb is the object, where is the subject? 
I use the term 'Patient- Verb' instead to describe the construction concerned in my dissertation. 
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4. CONCLUSION 



We have discussed the conditions of word order variation in Mandarin, Min and Yue dialects 
from three aspects: syntactic, semantic, and discourse pragmatic. Evidence shows that the 
syntactic and semantic constraints on the flexibility of OV order in the three dialects vary, 
however, the discourse pragmatic motivations for using OV constructions in the three dialects 
are more or less the same. 

The major syntactic constraint on the use of OV order in the three dialects lies in tlie existence 
of postverbal complements: in Mandarin, OV order, most likely with the object marker ba must 
be used when there is a postverbal phrasal complement, but this is not true of the Yue dialect 
FurtheiTOore, when there is a sentential complement, the Min dialect prefers OV order, while the 
other two dialects do not. As a result of these differences, some OV constructions WITH an 
object maiicer in Mandarin become OV constructions WITHOUT an object marker when converted 
into the other two dialects and vice versa. I argued that these syntactic differences among the 
three dialects trigger some of their semantic differences. The OV construction with the object 
marking ba in Mandarin, the so-called disposal form, is not always equivalent to the OV 
construction with the object marker tsiang/tsueng in Min and Yue. They differ both in function 
and style: Ba in Mandarin is almost always used as a grammatical marker witiiout any semantic 
implication, but tsiang in Min is often used as an instrumental marker. Stylistically 
tsueng/tsiang constructions in Min and Yue are more formal in discourse than tiie ba construction 
in Mandann. I suggested that these semantic and stylistic .differences between ba and 
tsiang/tsueng represent different facets of ba and tsiang/tsueng in the course of tiieir 
granmiaticalization. 

Interestingly, all tiiose syntactic and semantic differences do not affect the similarity of the 
di«:ourse pragmatic function of OV constructions in the three dialects. \Ve hrve found that OV 
orders without an object marker ba or tsiang/tsueng in all three dialects tend to iiave die emphatic 
funcuon: contrast or listing, but tiiose witii tiie object marker ba or tsiang/tsueng do not have tiiat 
tendency Further study of the non-emphatic OV constiiictions without a subject and the object 
markmg ba or tsiang/tsueng reveals tiiat they are used to demote agents and/or promote patients. 

All these results are instructive and helpful in understanding the function and typology of 
word orders in Chinese. They provide language specific evidence regarding tiie motivations for 
using oy constructions in the three dialects. Future research can compare the findings of tiiis 
study with those from studies of other languages so as to develop a theory of this aspect of 
umversal grammar. y ^ 
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ASPECTS OF PROSODY IN MANDARIN DISCOURSE 



Li-chiung Yang 

Department of Linguistics 
Georgetown University 

INTRODUCTION 

The importance of intonation in language is gradually becoming more recognized among 
linguists, psychologists, and cognitive scientists, and the notion that intonation is comparable to 
syntax, phonology, and semantics in its functions in language is now being considered. A clearer 
understanding of intonation and how it is affected by, affects, or substitutes for different 
syntactic structures, and how intonation can modify or even reverse the surface semantic 
meanings of speech, will help resolve^d fill in many gaps in current linguistic theory. 

Intonation is an essential component of meaning. It is important to speech because it helps 
to make things distinguishable. Intonation conveys differences among different mental or 
psychological states, speech acts, and semantic concepts. In normal speech, variations in 
cognitive states, focus, and in signals of relative importance are critical to successful 
communication. In conversation, we are not only interested in the facts of topics, but also have 
a critical interest in the judgements and attitudes of the other conversational participants. Because 
of its multi-dimensional nature, intonation provides an efficient way to express complex 
meanings and ever-changing variations in attitudes and states, and is therefore efficient in the 
transmission of differentiated information. 

Recent research has also shown that many characteristics of intonation are similar across 
languages (Bolinger, 1989; Fonagy, 1987; Femald, 1991). The expression of emotional and 
cognitive states through intonation is ever-present throughout language, and may have underlying 
bases of evolutionary adaptation, speech mechanism structure, and neurological abilities and 
constraints. A strong case for a universal basis for intonation would be considerably enhanced 
if patterns of intonation in a tone language such as Chinese could be shown to share similar 
intonational characteristics with non-tonal languages. 

RESEARCH PROCEDURES 

For this study, I recorded two sets of spontaneous natural discourse data in home settings 
between native speakers of Chinese. Each recorded conversation is about 3 hours in duration, 
totaling 6 hours of speech altogether. All of the informants are female speakers from Taiwan. 
Speech data were first analyzed perceptually, and then about 60 minutes of the speech data were 
digitized and analyzed acoustically using the WAVES speech software at the Phonetics Lab of 
Stanford University. 

My approach differs from previous research on intonation in that I take an integrated 
approach of analyzing intonation from a broader perspective of cognitive elements and discourse 
structure. Previous research has tended to concentrate on read speech and short sentences 
constructed in an experimental setting. These types of studies often provide interesting but 
isolated results. In real-life speech all of the elements of prosody are interwoven together. 
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Artificially separating and isolating just one parameter misses the relationships among all of 
these elements, and may distort the actual influence of even one single variable. This often 
results in oversimplification and over-generalization of research findings. In addition, there may 
be artifacts arising from the artificial production situation. 

In spontaneous discourse, there are frequent false starts, hesitations, repetitions, and other 
common speech production phenomena. Relationships among speakers are reflected in complex 
patterns of participant interactions and signalling of cognitive states. The development of the 
discourse itself undergoes rapid changes and frequent topic shifts, and rhythmic and emotional 
elements are very common. All of these things are reflected in the intonational patterns of 
natural discourse. Many intonational phenomena are likely to occur only in spontaneous speech, 
and are very difficult to imagine without looking at natural discourse itself. Because of these 
considerations, it is crucial to use natural spontaneous discourse data in order to gain a full 
understanding of how intonation functions in discourse. 

In this paper, our discussion is concentrated on three aspects of intonation in Mandarin: 
Focus, cognitive-affective states, and discourse phenomena. 

CHINESE TONAL SYSTEM 

Mandarin Chinese has 4 lexical tones and a neutral tone. Each syllable is associated with a 
specified tone and pitch difference is used to distinguish meaning. The standard Chinese tonal 
system and pitch values (Chao, 1968) are: 

Tone 1 " high and level 55 

Tone 2 ' rising 35 

Tone 3 " fall-rise 214 

Tone 4 ^ falling 51 

* Tone 0 variable 

ASPECTS OF INTONATION 

Focus and Non-focus 

Focus or stress is related to the importance of topic and is often used as a form of cognitive 
emphasis. In discourse we often want to signal the judgement of importance which we attach to 
specific linguistic units, and we stress according to the relative importance of that unit in the 
discourse hierarchy. 

In my data, I have found that focus in Chinese is often signalled by using full tone contour 
and by expanded pitch range. This is in agreement with Chao's observations (Chao, 1968). In 
In speech, however, pitch, amplitude and duration are often coupled together and therefore 
Ibcused words often have a higher amplitude ano lengthent d duration as well. Moreover, focus 
is often accomplished by specific rhythmic patterns and voice quality changes. 

In Chinese, focus is manifested differentiy with different tones. With focus, specific 
distinguishing characteristics of tones are often exaggerated to achieve prominence. For example, 
focus is often achieved in 1st tone words by a prolongation or su .tainedness of the syllable. For 
3rd tone words, focus is commonly expressed by lowness or by dipping lower in pitch, while 
4th tone focused words are characterized by a large pitch range and downward force. 
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Fcx5us-signalling is related to semantics and cognitive processing of new and old information. 
In my data, infrequently used or newly encountered words tend to receive emphasis to alert or 
direct the hearer's attention to an unusual item. Systems of focus and non-focus are cognitively 
more efficient since they allow the hearer to alternate between periods of high attention and 
relative relaxation. 

In contrast to emphasized words, deemphasized words tend to lose their lexical tone 
structure, and are often said at a faster rate due to their unimportant states. These unimportant 
words often change their shapes because of interpolation and accommodation to neighboring 
syllables and the surrounding environment. Sometimes non-focused words have a specific 
discourse function, therefore take on a specific intonation appropriate to that discourse function. 

Even focused words can change their form under specific emotional influences. In this 
example [you3 oO hao3 duol oO], 1st tone duol in the phrase hao3 duol oO "a whole lot" takes 
on a dramatic arched rise-fall form because of the exaggerated emphasis. This arched rise-fall 
shape is typical of situations of exaggeration. 




Figure 1: you3 oO hao3 duol oO 
[Yeah, a whole lot] 

Focus and non-focus are essentially iconic in nature. That is, we often spend more time on 
things that are important to us and spend less time on unimportant things. For this reason 
subordinate clauses and parentheticals are often said at a faster rate and a lower pitch level. In 
the example of Figure 2: [bu2guo4 tal Ii3mian4 you3 sanlshi2 ji3geO college] "But they have 
over 30 colleges", 1st tone sanl changes its tone shape dramatically from a level pitch to a 
strong fall-rise shape due to emphasis on the large number of colleges. Within the phrase 
sanlshi2 jiSgeO, sanl has the most emphasis and so is the longest, and the subsequent less 
emphasized syllables are progressively shorter. The emphasis in this case is also strengthened 
by the consonantal lengthening of the fricative sounds in both sanl and shi2. In my data, it is 
found that speakers commonly use lengthening of the fricative consonant to express emotional 
intensification. In these cases, the length of the consonant commonly takes up more than one-half 
of the total syllable duration. 
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Figure 2: bu2guo4 talmenO Ii3mian4 you3 sanlshi2 jiSgeO college oO 
[But they have over thirty colleges Oh] 

Elements of prosody are often interwoven together. In the example of Figure 3: [ke3shi4 
mei2you3 Ieng3qi4] "But there wasn't any air conditioning", focus is achieved by modifications 
in duration, pitch range, and amplitude. The pitch level of ke3shi4 mei2you3 "but there wasn't 
any" is high and the pitch range is narrow. These words are relatively unimportant and are said 
at a faster speed. Leng3qi4 "air conditioning" is emphasized and so expanded in duration and 
pitch range. The steep drops in pitch exaggerate and emphasize the lowness of 3rd tone lengS 
and fall of 4th tone qi4. The lengthening and pitch drops in Ieng3qi4 also contribute to the 
complaining impression of this example. 




Figure 3: ke3shi4 mci2you3 Ieng3qi4 

[But they didn't have air-conditioning] 
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Cognitive and Affective States 

Certain cognitive states tend to have certain systematic influences and cause specific 
variations of the shapes of intonation through modifications to pitch shape and height, amplitude, 
and duration. Variations in pitch slope and direction are significant in contributing to the overall 
discourse interpretation. In Chinese, syllables with a convex pitch shape are commonly perceived 
as more harsh. Concave pitch shapes commonly give a softer impression. Pitch level and 
rhythmic tempo are also used for specific discourse effects. For example, a raised pitch form 
is often used to refer to things at a distance, either physically not in the immediate area, or 
metaphorically far away, as in reference to a prior topic. 

On the whole, pitch raising and lengthening are associated with doubt, uncertainty, 
continuation, and hesitation, while pitch lowering is often associated with certainty, definiteness, 
finality, authoritativeness, and negativity. The degree of tentativeness or definiteness is often 
correlated with the steepness of pitch slope. These lowering and raising effects are often strong 
enough to change citation high level 1st tone to become similar to falling 4th tone contour, and 
4th tone to a high 1st or rising 2nd tone shape. 

Figure 4 is an example to illustrate the effect of negativity on tones. Here the speaker is 
describing the appearance of mummies: [xiang4 ne4ge0 rou4ganl]. The strong negative feeling 
causes 1st tone ganl to take on a falling 4th tone shape, and the entire word rou4ganl is said 
low in pitch. Negativity is often accompanied by a choppy, broken rhythm, expressed in this 
example by the clipped speaking rhythm of rou4ganl. Surprise, on the other hand, is often 
characterized by a high pitch level, a strong rise-fall shape, and a steep pitch slope, as can be 
seen in the same chart. In this example, speaker B's surprised response "owaa" has the typical 
pitch shape of surprise. 
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Figure 4: xiang4 ne4ge0 rou4gan 1 owaaa 
[Looks like dried meat Wow] 

Hesitation and uncertainty are often accompanied with rising pitch levels, and lengthening 
of syllables. The rising form expresses the questioning uncertainty state, and the lengthening 



o BEST COPY AV^aLl4i.;LL 



-435- 

44b 



gives the speaker time to continue. An example of hesitation can be seen in Figure 5: [ran2hou4 
ne4ge0 jiu4shi4]. Because of the associated uncertainty, the 4th tone jiu4 changes to a rising 
pitch form while geO is greatly lengthened due to hesitation. The rising pitch form of ran2hou4 
in this example also reflects the uncertainty state. In this case, 4th tone hou4 changes 
dramatically from a falling shape to a rise-level shape. 
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Figure 5: ran2hou4 ne4ge0 jiu4shi4 
[Then that just is] 

In Figures 6 and 7, we can see how variations in pitch slope give rise to different 
perceptions of cognitive state. In Figure 6, the concave shape of dui4 "right" accompanies a soft, 
gentle and prolonged agreement with the main speaker. By contrast, the mainly convex and steep 
downward slope of dui4 in Figure 7 is associated with a strong emphatic and resolute agreement 
with the other speaker 
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Figure 6: dian4 shen2meO doul hen3 zao3 doul guanl dui4 [Right] 
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Figure 7: yinlwei4 zhao4 xue2xiao4 ne4ge0 biao3ge2 zhi3you3 yildianSdianS dui4 [Right] 
Discourse Markers 

In discourse, topic focus and the interest of participants are constantly changing, and many 
aspects of speech are involved in cooperatively coordinating the flow of topics and interests 
between participants. One class of words which are frequently used in this cooperative 
coordination are discourse markers. Discourse markers are words or phrases which are 
commonly used to signal the- relationship between discourse units. They include words such as 
suo3yi3 "so", yinlwei4 "because", ke3shi4 "but", dan4shi4 "but", ran2hou4 "then" and jiu4shi4 
"that's just". Discourse markers guide the conversation along so it can proceed in a more unified 
smooth manner. They are important because they often signal new things or that something 
important is coming up, and they signal the relationships of topic flow and the nature of each 
connecting relationship. 

Analysis of the data shows that discourse markers such as ranhou perform at least three 
principle functions. They signal the flow of topic by acting as indicators of relationships between 
phrases and subtopics. Discourse markers also act as signals between interacting participants to 
control floor-negotiations such as floor holding and turn-taking, and are often used as a strategy 
to gain time to recall or organize what to say next. Thirdly, they function as expressions of 
cognitive-affective states. Because of these considerations, discourse markers tend to express a 
lot of emotion, and therefore carry a lot of intonation. 

In my corpus, the discourse marker ranhou was used quite frequently, especially in narrative 
speech, to connect both temporal sequences and event sequences, i.e. narratives which follow 
a natural or logical development. By focusing on ranhou in different discourse contexts we can 
glean an understanding of how intonation performs its cognitive and discourse functions. 

In general, ranhou with a narrow pitch range was found to indicate continuing topic 
development from the immediately preceding phrase while ranhou with a larger pitch range tends 
to signal a return to a previous topic, usually after an intervening sub-topic. In the example of 
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Figure 8, the shape of ranhou matches well with the lexical rise-fall tonal pattern. The pitch 
range is relatively high and ranhou is perceptually prominent both from its pitch range and 
amplitude. The prominent ranhou here signals the break from the immediately preceding topic. 

JfcO 
340 

120 

,100 

2-10 
22(i 
200 
ISO 
160 
MO 
120 
100 

BO 

60 

40 

B.6 9. 6 9 9..' 9.4 9.6 9.6 10 10..' 

Figure 8: ran2hou4 wo3mO jiu4 qing3 ren2 lai2 lu4yinl maO 
[Then we asked people to come to record] 




In the following phrase, as seen in Figure 9, ranhou has a rise-level contour in a narrow 
pitch range. This phrase is a natural continuation of the preceding phrase, and this is reflected 
in the moderate pitch height and narrow range of ranhou. The change in kou4 from a falling 
form to a level or slightly rising contour also expresses continuation and some uncertainty. We 
can see that yi3hou4 in ttiis example also reflects some uncertainty and this is expressed in the 
level ending following the initial fall in hou4. 
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Figure 9: ran2hou4 lu4yinl leO yi3hou4 [Then after we did the recording] 
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By the following phrase, Figure 10, the speaker has successfully recalled a relevant piece 
of information and is more confident of what to say, and this is reflected in the downward pitch 
slopes of both 2nd tone ran and 4th tone hou, although the level ending on this hou4 suggests 
some uncertainty is still present. 
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Figure 10: ran2hou4 jiu4 ba3 ne4ge0 Iu4yinldai4 
[Then we took the tape] 

The contour of ranhou in Figure 11 illusL'^tes what often occurs in competitive floor- 
negotiations between participants. 
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Figure 11: wo3 zhei4ci4 hui2 - ran2hou4 ran2hou4 ni3 jiu4 ni3 jiu4 ke3yi3 qu4 
[This time I - Then then you can just you can just go] 
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My data show that in interruption situations, speakers often respond by using discourse 
markers to hold the floor with loud but relatively empty-content words. In Figure 11, speaker 
A tries to interrupt by saying "Wo3 zhei4ci4-" meaning "This time Speaker B immediately 
reacts by repeating ranhou two times with a hi^h amplitude and expanded pitch range. Note that 
thftfe is a 50Hz pitch drop between the first and second ranhou. This reduplication and the 
down-stepping seen here are common in this type of case, because the speaker will first try to 
hold the floor, then return to a more normal pitch level to resume. 

In the example of Figure 12, the speaker is speaking in a slow and careful manner. The 
pitch range of ranhou is expanded, and amplitude is high. Ranhou is prominent in this case 
because it is being used to signal the contrast between the "Chinese input" of the previous phrase 
and the "English output" of this phrase. In this sense, it shares with anaphoric reference a 
contrasting or tuming-away from the previous phrase. 
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Figure 12: ranhou shi English output 
[Then the English output] 



CONCLUSION 

Analysis of discourse data shows that tones and intonation interact in systematic ways so that 
participants successfully communicate the many levels of meaning present in discourse. 
Spontaneous natural discourse is a complex process encompassing cognitive relations, 
interrelationships among people, and emotions and judgements on discourse topics, and 
intonation is of critical importance in expressing this complexity. Many elements of intonation 
are fundamentally iconic in nature, because the relationships of the forms of sound mirror the 
internal connections which exist between physiology, cognition, and emotion. This is where 
intonation achieves its full forcefulness in all languages. 
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Introduction 

In the past decades, the Formosan languages have been investigated more or less 
extensively. Though the phonology of most of these languages has been well studied, 
the syntax of the vast majority, not to talk about the variations that divide the dialects 
of each language family, is still poorly understood'. As a consequence, the 
subrelationships of the Formosan languages with respect to one another and to the 
languages spoken outside Taiwan are still controversial. 

The present paper being part of on-going research on the grammatical typology of 
the Formosan languages, we do not intend to discuss here the internal relationships of 
these languages or try to reconstruct the proto language from which they are issued. 
Our aim is to present a typological overview of the temporal/aspectual and modal 
systems of five of these languages (Atayal, Bunun, Rukai, Saisiyat, Tsou), which have 
been selected both for their geographic dispersion and syntactic diversity. 

Geographically, these languages stretch from north to south and east to west. 
Atayal is the most widely spread language: it is found in the northern (Ilan, Taipei, 
and Taoyuan counties), north-western (Hsinchu, Miaoli and Taichung counties) - 
where Saisiyat is still spoken by a small community - central (Nantou county) and 
eastern (Hualien county) portion of the island. Bunun is located in Central and 
Southern Taiwan. Tsou is spoken in Southern Taiwan, in the mount Ali area, and 
Rukai stretches through the South (Taitung, Pingtung and Kaoshiung counties). All 
these languages have various dialects^ which may differ only phonologically (e.g. 
Tsou, Saisiyat) or diverge also syntactically (e.g. Rukai, Atayal, Bunun). 

'. If not mentioned otherwise, the data presented in this paper comes from our own field work 
(published or unpublished); it was collected between 1988 and 1994 with the financial support of (i) the 
National Science Council (NSC grants 77-0301 -H003- 14, 78-0301 -H003-19, 80-03O1-HOO3-O1 on 
Atayal, NSC grant 83-030 1-K003-0 17 on the "Typology of grammatical relations in some Formosan 
languages"), (ii) the local govemmet\t of Kaoshiung Prefecture on Bunun, and (iii) the Pacific Cultural 
Foundation on Rukai. 

2. Because of limitations of both space and time, we have been obliged to select from each language a 
representative dialect on which we could base our discussion. Our selection was made on the basis of 
the following factors; (i) the data was available to us; (ii) the dialect in question was found to be the 
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Syntactically, all these languages are basically verb-initiall The postverbal order of 
the NP arguments is more or less fixed and determined by a series of syntactic and 
semantic factors that will be discussed in forthcoming papers (see Huang; Zeitoun). 
With the exception of Rukai^, all these languages pattern alike in having the semantic 
role of the NP selected as subject morphologically marked on the verb by means of an 
affix. The peculiarity of this focus' system lies in the fact that any NP, whatever its 
semantic role (e.g. agent, theme, locative, instrument, etc.), may serve as subject. We 
can roughly distinguish two kinds of constructions: in the first one, the agent is 
viewed as the focus of the clause (the A(gent) F(ocus) construction); in the second, 
any other NP can function as subject (the N(on)-A(gent) F(ocus) construction), the 
verbal affix determining its semantic role (either theme/patient, source/goal/locative 
or instrument )5. This dichotomy is illustrated in (la) and (lb) respectively. 

( 1 ) Atayal ( Wulai), Huang ( 1993: 10-1 1 ) 

a. t-m-tu? tali? qhuniq '^' 
crush-AF Tali? tree 
'The tree crushed Tali? 

b. t?-an qhuniq taH? 



most representative of its family. Atayal consists of two major dialects, Sqiiiiq and C?uliV We selected 
Wulai to illustrate the former. Bunun is made up of five dialects, Takitodo, Takibaka Takbanuafl 
Takivatan and Isbukun, the latter of which will be the object of our study. Finally, Tfoea and Budai were 

' v^^r .Qon'"^'^"^^^*''^ °^ ^"'^ languages respectively. The data on Saisiyat is based 

on Y c ^ 1 yy i v 

^ Note that Smsiyat displays SVO word order in ellicited sentences - supposedly under the influence of 
Chinese (Ye, 1991:34-36) - but VSO word order in folkstories and traditional songs 

In Rukai verbal affixes indicating the semantic role of the focus NP have been lost throueh the 
attrition of the initial auxiliary. Hence, in (i), there is no syntactic coding of the subject on the verb 

(i) Budai 

w-a-sititi -aku ki lampau 

u-Real-beat-IS.Nom Acc Lampau 
'I beat Lampau' 

With the exception of Mantauran, the Rukai dialects have developed an active/passive voice 
dichotomy not found in the other Formosan languages. Compare (i) and (ii) 

(ii) Budai 

ki-a-sititi nakuana fcu lampau 

ki-Real-beat IS.Acc Norn Lampau 

'Lampau was beaten by me' 
^. To simplify, we will purposely ignore the morphological vanations that divide these languages and 
will not explain in detail the semantic function of each of the verbal prefixes. We may say briefly that in 
AF constmctions, verbs are usually marked with -urn-, m-, 0, etc. while in NAF constructions, they may 
be suffixed with -tm (PF), -a« (PF and/or LF) or prefixed with s-, ,v/-, a,v- (IF/BF). 
^ To avoid any confusion, we use IPA symbols throughout the whole paper. The following 
abbreviations are used in the glosses: Acc: Accusative, Adv: Adverb, AF: Agent Focus, Asp: Aspect 
Ben. Benefactive, BF: Benefactive Focus. Caus: Causative, Comp: Completive aspect. Gen: GeneUve' 
IF: Iterative Focus, Itr: Iterative, Irr: Irrealis. Loc: Locative, LF: Locative Focus, NAF Non Agent 
Focus, Ncg: Negation, Norn Nominative, Obi. Oblique, P. Plural, PF: Patient Focus, Real Realis Red 
Reduplication, S: Singular. 
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crush-NAF tree Tali? 
'A tree cnished Tali?' 



They differ, however, in a number of respects: (1) some of these languages preserve a 
nominal case marking which has been lost in others (e.g. Mayrinax vs. Wulai in 
Atayal); (2) while the nominal case marking of most of these languages (e.g. Atayal, 
Rukai) is based on the nature of the referent, i.e. common nouns are marked 
differently from personal nouns and kinship terms, Tsou has developed a complex 
system in which the referent is localized in relation to the universe of discourse (see 
Zeitoun 1993)^; (3) some languages (e.g Atayal, Tsou as opposed to Rukai or Bunun) 
have a system of auxiliaries which usually occur in clause-initial position. The 
occurrence of these auxiliaries is not syntactically required for the well-formedness of 
a sentence in Atayal (Wulai) but cannot be omitted in Tsou. 

We will not elaborate ftirther on these linguistic variations. They were introduced 
to show that the Formosan languages are worth being examined for themselves and 
from a typological point of view. Having presented the geographic distribution and 
outlined briefly some of the syntactic features of the languages under study, let us 
now return to our immediate concern. The aim of this paper is to show that the 
Formosan languages, like the Philippines-type languages (see Reid 1992), have a 
complex system of verbal morphology which includes distinctions of voice (or focus), 
tense/aspect and mood. In the absence of temporal/aspectual or modal 
affixes/particles, focus affixes carry temporal/aspectual or modal information. Hence, 
in all the languages under study here, if the temporal frame is left unspecified, the AF 
m- forms (realized as -urn, -m-, m-, 0, etc.) are found to refer to situations having 
either occurred or actually taking place (realis). In co-occurrence with 
temporal/aspectual or modal affixes/particles, focus affixes retain their primary 
fv"ction: they indicate the semantic role of the NP selected as focus. In other words, 
(AF) m-forms may be found in the irrealis, but they do not carry any 
temporal/aspectual or modal information. Compare (2)-(3). 

(2) a. Atayal, Wulai (Huang 1993:41 ) 
m-qwas qutux knerin 
AF-sing one woman 
'A woman is singing/sang' 

b. Bunun (Isbukun) 

ma-baliv-ik tu tasa tu lumah 



Our analysis differs from that of Tung (1964: 147), who proposes that case markers locate a referent 
in space, the speaker being taken as the origo. In Zeitoun (1993), it is shown that (1) we must drop such 
an egocentric analysis because it fails to account for some of the data, and that (2) three parameters 
must be taken into consideration.: the speech act participants plus the time and the place of the utterance 
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AF-buy-lS.Nom one house 
'I'm buying/bought a house' 



(3) a. Atayal, Wulai 

musa? m-qwas qutux knerin 
Asp AF-sing one woman 
'A woman is going to sing' 

b. Bunun (Isbukun) 
na-ma-baliv-ik tu tasa tu lumah 
Irr- AF-buy- 1 S . Nom one house 
'I want to/will buy a house' 

In the absence of focus affixes on the verb, temporal/aspectual affixes/particles may 
carry voice distinctions. In a number of languages (e.g. Atayal, Bunun), the temporal/ 
aspectual particle -/«-, which indicates relative anteriority in Wulai (see Huang 
1993:67) or a completive aspect in Isbukun, is used in PF constructions. Compare (4) 
and (5). 

(4) Atayal (Wulai) 
p-in-aga?-maku? laqi? qani 
PF-carry-IS.Gen child this 

'I (once) carried the child on my back' 

(5) Bunun (Isbukun) 

1-in-udah-ku? takna? hay minsum-ag 
PF-beat-lS.Acc yesterday Tp come-still 
'The one beaten by me yesterday came again' 

1. The Realis/Irrealis dichotomy 

1 . 1. Against a tripartite system (Past, Present, Future) 

Li (1973:157) claims that the temporal system of Tanan (Rukai) - closely related to 
Budai - is based on a tripartite distinction of past, present and future, realized 
respectively with the affixation (to the main verb) of wa-, 0 and ay-, as shown in 
Table 1 below. 

Table 1 : Li's ( 1 973) classification of tense and aspect in Rukai (Tanan) 
Active voice 

Aspect Plain Completive Continuative 
Tense 

Pres kano kano-qa kano-kano 
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Past w-a-kana w-a-kana-ga w-a-kanakana 
Fut ay-kana ay-kana-ga ay-kana-kana 



Passive voice 

Aspect Plain Completive Continuative 
Tense 

Pres ki-kana — ki-kana-kana 

Past ki-a-kana ki-a-kana-ga ki-a-kanakana 

Fut ay-ki-kana ay-ki-kana-ga ay-ki-kana-kana 



His analysis raises a number of problems. We v^U only mention three of them. 
First, note that the 'plain' form (e.g. kana 'eat') is never used as such, hence (5) is 
ungrammatical: kano represents a root form. 

(5) Rukai (Tanan) 

* kana kuani umas sa aga 
eat that man rice 

Second, a verb prefixed with w-a- may refer to either a past or present event, 
depending on the occurrence of (a) a temporal adverb as in (7b), or (b) an aspectual 
particle as in (8). 

(7) Rukai (Tanan) 

a. v/-a-kana kuani umas sa aga 
u-Real-eat that man rice 

i. 'The man is eating the rice' 

ii. The man ate the rice' (Li 1973:158) 

b. w-a-kana kuani umas sa aga kudaa 
u-Real-eat that man rice yesterday 
'The man ate the rice yesterday' 

(8) Rukai (Tanan) 

v/-a-kana -ga kuani umas sa aga 
u-Real-eat-Comp that man rice 
'The man has eaten the rice already' 

Third, by talking about a 'future' tense - expressed by the prefixation of ay- in 
Tanan - Li makes abstraction of the modal interpretation, illustrated in (9), implied in 
this type of construction. 

(9) Rukai (Tanan) 

a. (a)y-ua -su inu ? 
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Irr-go-2S.Nom where 

'Where will you go/are you going ?' 



b. ay-tumawlay -aku kyasa 
Irr-tell tale-lS.Nom now 
'I shall tell/am going to tell a story now' 

A comparison of the above examples clearly shows that Budai establishes a basic 
modal (and not temporal) distinction between realis, where situations are viewed as 
having occurred or actually taking place, and irrealis, where events are perceived as 
having not yet happened, may possibly happen (hypothetical) or will definitely not 
happen (counterfactuals). Though the Rukai dialects differ from the other Formosan 
languages in their voice systems, all these languages pattern alike in sharing this 
dichotomy. Further examples, taken from Atayal and Bunun, are given in (10) and 
(11) respectively. 

(10) Atayal (Wulai) 

a. m-qinax tali? hira? 
AF-run Tali? yesterday 
'Tali? ran yesterday' 

b. m-qinax tali? kryax 
AF-run Tali? every day 
'Tali? runs every day' 

c. p-qinax-saku? (suxan) 

AF-run- IS.Nom (tomorrow) 
'I wall run (tomorrow)' 

(1 1) Bunun (Isbukun) 

a. ma-ludah -ku? saya? t akna? 
AF-beat-lS.Acc 3S.Nom yesterday 
'He beat me yesterday' 

b. saya? hay kawpa tu hanvan ma-ludah madaku 
SS.NomTp every day AF-beat IS.Acc 
'He beats me every day' 

c. mais sadu? saikin saitia? hai na-palinutu-an-ku? 

if see IS.Nom 3S.ACC Tp will-tell-NAF-lS.Acc 
'If I see him, I'll tell him' 

Our claim is supported by the fact that in the irrealis, AF and NAF constructions 
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are subject to a number of morpho-syntactic as well as semantic constraints although 
not all these languages exhibit the same kind of constraints. 



1.2. Morpho-syntactic and semantic constraints 

Morphologically, we note some discrepancies in the type of verbal 
affixes/auxilianes used in the realis and in the irrealis. 

In Atayal (Wulai), verbs occurring in AF constructions are usually prefixed with m- 
or infixed with -m- in the realis. They are marked with p- in the irrealis. This contrast 
is illustrated in (12a-b). 

(12) Atayal (Wulai) 

a. m-qwas-saku? ( hira?/*suxan ) 

AF-si ng- 1 S .Nom (yesterday/*tomorro w) 
'I sang (yesterday)' 

b. Huang (1993:11) 
p-qwas-saku? ( suxan/*hira? ) 
AF-sing-1 S.Nom (tomorrow/*ycsterday) 
"I will sing (tomorrow)' 

In NAF (and more specifically in PF) constructions, verbs are marked with -an in the 
realis but with -un in the irrealis**. Compare ( 13a-b). 

(13) Atayal (Wulai) 

a. phiy-an-maku? tali? (hira?/*suxan ) 
beat-NAF-lS.Gen Tali? (yesterday/*tomorrow) 
'Tali? was beaten by me (yesterday)' 

b. phiy-un-maku? tali? ( suxan/*hira? ) 
beat-NAF-lS.Gen Tali? (tomorrow/*yesterday) 
'Tali? will be beaten by me (tomorrow)' 

In Tsou, auxiliaries are marked as AF or NAF in the realis (cf mi- vs. /-; mo{h) vs. 
o{h)); they are invariable in the irrealis (cf te, (a, tena, etc.). In both cases, how«^/er, 
tiie semantic role of the NP selected as subject is marked on the verb by means of an 
affix. Compare (14)-( 15). 



In Atayal (Wulai) and Bunun (Isbukun), the instrumental/benefactive foci (.v- in Wulai and 3.v- in 
Isbukun) occur in the realis and the irrealis because the NP selected as focus only involves a peripheral 
argument. 
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(14) Tsou 

a. mo-Vu Baito to oko nehucma 

A.F-lS.Nom see Obi child yesterday 
'I saw the child yesterday' 

b. o-?u aiti ?o oko nehucma 
NAF-lS.Gen see Nom child yesterday 
'I saw the child yesterday' 

(15) Tsou 

a. te-?o Baito to oko hohucma 

will-lS.Nom see Obi child tomorrow 
'I will see the child tomorrow' 

b. te-?o aiti ?o oko hohucma 

will-lS. Gen see Nom child tomorrow 
'I will see the child tomorrow' 

Syntactically, NAF constructions may be prohibited in the irrealis. Ye (1991:71fF) 
argues quite convincingly that in Saisiyat the modal auxiliary ?am is used in co- 
occurrence with verbs marked as AF but not with those marked as NAF. A 
nominalized construction must be used instead in PF constructions to refer to the 
future. Compare the grammatical ity of ( 16a-c). 

(16) Saisiyat, Ye (1991:72) 

a. gyaw ?arn s-om-?ael ka ?alaw 
cat will eat-Af fish 
'The cat is going to eat the fish' 

b. * ?alaw ?am si-?ael-en noka? gyaw 

fish will eat-PF Gen cat 

c. Valaw ka-si?ael-en noka? tjyaw 
fish eat-PF Gen cat 

'The fish is going to be eaten by the cat' (Lit: 'The fish will be the cat's eating') 

Semantically, AF constructions are opposed to NAF constructions in the realis in 
terms of perfectivity/imperfectivity. This contrast is exemplified in (17) and (18). 

(17) Atayal (Wulai) (Huang 1993:41) 

a. m-in-qPaq-saku? ke? na? tayan 
AF-past-leam-lS.Nom word na? Atayal 

'{ learned Atayal' (1 may still be learning it; I still can't speak the language) 
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b. q-in-Paq-an-maku? ke? na? tayan 
past-leam-NAF-lS.Gen v\jrdna? Atayal 
'I ieamed Atayal' (I can speak the language) 

(18) Tsou 

a. mi-ta m-imo ta emi 
AF-3S.Nom drink Obi wine 
'He is drinking wine' 

b. i-ta ima ?e emi 
NAF-3S. Nom drink Nom wine 

"He has drunk wine' (Some is still left) 

If we accept Comrie's (1976) classification of aspectual oppositions - we reproduce 
below the table on given p. 26 - we can then easily account for the fact that (in most 
languages) AF constructions are commonly used to describe habitual or continuous 
(progressive and nonprogressive) situations^. A comparison of (19b-c) and (20a-b) 
shows that (1) in the absence of aspectual auxiliaries/particles, there is no neat 
distinction between the progressive and the non-progressive, (2) the situational 
context will determine whether the event referred to has already happened or is 
actually taking place. 

Table 2: Comrie's (1976:26) classification of aspectual oppositions 



Perfective Imperfective 

Habitual Continuous 

Nonprogressive Progressive 



(19) Atayal (Wulii) 

a. k-m-ayan -saku? squ? sunan kryax (Huang 1993:61 ) 
AF-say-lS.Nom squ? 2S.Loc every day 
'I talk to you every day' 



This does not exclude the fact that in NAF constructions the occurrence of aspectual 
auxiliaries/particles may yield a progressive reading. Compare (i) and (ii). 
(i) Tsou (ii) Tsou 

i-ta ima Ve emi i-ta nVa irna Ve emi 

NAF-3S. Gen drink Nom wine NAF-3S.Gen still drink Nom wine 

'He has drunk wine' 'He has been drinking wine' 
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b. k-m-avan -saku? squ? sunan 0 
AF-say-lS.Nom squ?2S.Loc0 

i. 'I talked/was talking to you' 

ii. 'I am talking to you' 

c. nyux -saku? k- m-ayan squ? sunan 
Asp-lS.Nom talk squ? 2S.Loc 

i. * 'I talked to you' 

ii. 'I was/am talking to you' 

(20) Tsou 

a. mo eoBako to oko ?o amo 

AF beat Obi child Nom father 

i. 'Father beat the child' 

ii. 'Father is beating the child' (Both unseen at Speech Time) 

b. mo n?a eoBako to oko ?o amo 
AF still beat Obi child Nom father 

i. * 'Father beat the child' 

ii. 'Father is (still) beating the child' (Both unseen at Speech Time) 

In the irrealis, no such distinction is found. Compare (1 8)-(21 ) and (22a-b). 

(21) Tsou 

a. te-ta m-imo ta emi 
will-3S.Nom drink Obi wine 
'He will drink wine' 

b. te-ta ima ?e emi 
will-3S.Gen drink Nom wine 
'He will drink wine' 

(22) Tsou 

a. mi-ta n?a mimo ta emi 
AF-3S.Nom n?a drink Obi wine 
'He is (still) drinking wine' 

b. te-ta n?a mimo ta emi 
will-3S.Nom n?a drink Obi wine 
'He is going to drink wine again ' 

1.3. Degrees of complexity 



So far, we have shown that all the Formosan languages (under study) exhibit a 
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basic modal (and not temporal) distinction between realis and irrealis. Below, we will 
suggest that they display various degrees of complexity. 

1.3.1. Perfectivity/imperfectivity in Rukai 

Wc have shown that in the realis, AF constructions are opposed to NAF 
constructions in terms of perfectivity/impertectivity. This is true of most, but not all, 
the Formosan languages. As mentioned in Li (1973) (cf Table 1), Rukai expresses 
these concepts through two morphological means, affixation and reduplication. In 
Budai, the affixation (to the verb) of the aspectual particles -ga on the one hand and - 

ana on the other hand are used to reflect the concepts of 'perfectivity' (or completive 
aspect) and 'imperfectivity' (or progressive aspect). The reduplication of part of the 
verb stem may either yield an habitual or a progressive interpretation, depending on 
the context (e.g. presence of different case markers in Budai). This contrast is 
illustrated in (24a-b). 

(23) Rukai (Budai) 

a. w-a-tubi-ga ka Lolai 
Real-cry-already child 
'The child has cried' 

b. w-a-tubi-ana ka Lolai 
Real-cry-still child 

'The childis crying' 

(24) Rukai (Budai) 

a. w-a-kanakang -su ka bsbbolo 
Real-eat-2S.Nom banana 
'You are eating that banana' 

b w-a-kanakan a-su ku bolobolo 
Real-eat-2S.Nom banana 
'You usually eat bananas' 

1 .3.2. Habitual/generic meaning in Tsou 

We have argued that in the realis, AF constructions are usually used to describe 
habitual or continous situations. We have shown, however, that because Rukai lacks 
the focus system commonly found in the Formosan languages, this interpretative 
variation is carried out by a morphological device, i.e. the reduplication of part of the 
verb stem. Tsou differs also in a number of respects. In this section, we will show that 
the habitual/episodic readings are taken ovei by different auxiliaries - compare the 
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use of aa- in (25) to that of mi- in (26) - while in the next section, we will suggest that 
it has grammaticalized the notion of tense. 

(25) Tsou 

a. da-ta Boni to tacimi 
ltr-3S.Nom eat Ob! banana 

i. 'He (usually) eats bananas' 

ii. * "He is eating a bana^a^ananas' 

b. da-ta huhucmasi Boni to tacimi 
Itr-3S.Nom every day eat Obi banana 
'He eats bananas every day' 

(26) Tsou 

a. mi-ta Boni ta tacimi 
AF-3S.Nom eat Obi banana 

i. 'He is eating a banana/bananas' 

ii. * 'He usually eats bananas' 

b. * mi-ta huhucmasi Boni ta tacimi 

AF-3S.Nom every day eat Obi banana 

Note that da- refers to an 'habitual present' so that reference to the past or to the future 
necessitates the use of temporal auxiliaries (e.g. mo{h) and tena) - which must occur 
in clause-initial position - as in (27a-b) respectively. 

(27) Tsou 

a. moh -ta da huhucmasi Boni to tacimi 
AF-3S.Nom Itr every day eat Obi banana 
'(In the past), he would eat bananas ever^ day' 

b. tena da-ta huhucmasi Boni to tacimi 
Irr Itr-3S.Nom every day eat Obi banana 
'(In the future), he will eat bananas every day' 

In both examples, the iterative meaning is yielded by the occurrence of huhucmasi 
'every day'. Its absence in (28a) below gives a different meaning from that of (27a) - 
the utterance must be given an episodic interpretation - while it renders the second 
ungrammatical. Compare (27a)-(28a) and (27b)-(28b) respectively. 

(28) Tsou 

a. moh-ta da Boni to tacimi 
AF-3S.Nom Asp eat Obi banana 
'He ate a bananaliananas' 
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b. * tena da-ta 6oni to tacimi 

Irr Itr-3S.Nom eat Obi banana 

13.3. Grammaticalization of tense in Tsou 

It was suggested that in the realis, AF constructions refer to present or past events 
depending on the situational context (e.g. occurrence of case markers, temporal 
adverbs or aspectual particles): as illustrated in (29) below, in Atayal (Wulai), a verb 
marked as AF may appear in co-occurrence (1) with various adverbs (e.g. hira? 

'yesterday', soni 'today', kryax 'every day') which determine the temporal frame of the 
utterance or (2) with the aspectual particle -in-. Tsou differs from the other Formosan 
languages in that it has grammaticalized the notion of (absolute) tense: only mo- but 
not mi- can co-occur with nehuctna 'yesterday'. Compare the grammaticality of (30a- 
b). In the same vein, the aspectual particle oa can co-occur with mo- (or moso, o{h)) 

but not with mi- (see (31)). A comparison of these examples clearly indicate that 
AF/NAF auxiliaries in Tsou not only carry aspectual but also temporal information as 
well. 

(29) Atayal (Huang 1993) 

a. m-in-ima? sayun tali? soni (p. 50) 
AF-past-wash Sayun Tali? today 

'Tali? washed Sayun just now" 

b. m-ulu -sami lomwa hira? (p.58) 
AF-fmd-lPE.Nom rascal yesterday 

'We found a rascal yesterday' (by accident)' 

c. k-m-avan -saku? squ? sunan kryax (p. 61) 
m-say-lS.Nom squ 2S.Loc every day 

'I talk to you every day' 

d. m-in-wah -saku? (p.9) 
AF-past-come-1 S.Nom 
'I came' 



(30) Tsou 

a. * nTi-?o 6oni to tacimi nehucma 

AF-1 S.Nom eat Obi banana yesterday 

b. mo-?u boni to tacimi nehucma 
AF- 1 S. Nom eat Obi banana yesterday 
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'I ate a banana yesterday' 

(3 1 ) a. * rni-ta da smovei ta oko 

AF-3S.Nom Asp carry Obi child 

b. moh -ta da smovei ta oko 
AF-3S.Nom Asp carry Obi child 
'(In the past), s/he carried the child' 



2. Co-occurrence restrictions 



Below, we argue that in order to understand the temporal/aspectual systems of the 
Formosan languages, (a) all the constituents of the sentence must be taken into 
consideration to account for the possible and impossible co-occurrences of auxiliaries 
with pronominal clitics (e.g. Atayal) or with case markers (e.g. Tsou, Rukai). 

2.1. The auxiliaries nyux and cyux in Atayal (Wulai) in co-occurrence with different 
pronouns 



Atayal has grammaticalized verbs of possession/location {cyux 'have') and existence 
{nyux 'exist') into aspectual auxiliaries. In that language, imperfectivity is rendered 
through the use of these two auxiliaries, which can (both) co-occur with verbs marked 
as AF or NAF. A comparison of (3I)-(32) shows, however, that they cannot permute 
freely: as mentionned in Huang (1993:71), nyux (the Proximal Imperfective) 
designates "an action taking place close to the speaker", which explains why it can 
only co-occur with first person pronominal forms (singular or plural), (see (32a-b). 
Cyux, on the other hand, indicates that the action is "taking place away from the 
speaker". In co-occurrence with a first person pronoun, the event must be interpreted 
as occurring before, not at Speech Time. Compare (32a)-(33b) and (33a-b). 

(32) a. nyux- saku ? maniq qulih 
Asp-lS.Nomeat fish 
'I am eating fish (now)' 



b. * nyux maniq qulih hiya? 
Asp eat fish 3S.Nom 

(33) a. cvux - saku? maniq qulih 

Asp-lS.Nom eat fish 
i. * 'I am eating fish (now)' 
II. '1 was eating fish' (when answering the phone)' 
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b. cyux maniq qulih hiya? 

Asp eat fish 3S.Nom 
'He is eating fish (now)' 



2.2. Auxiliaries and case markers in Tsou 

Tsou is characterized by the fact that it has developed a complex system of case 
markers divided into two classes, nominative and oblique, which do not (as in other 
Formosan languages) mark an NP according to its categorial nature, i.e. mark 
common nouns differently from personal nouns and kinship terms, but localize a 
referent in relation to the universe of discourse. Briefly, we may say that, on the basis 
of their syntactic distribution (see Zeitoun, 1993), we must distinguish between 
referential {i'e, si, ta, ?o, to) and non-referential case markers {na and no)'". 
Furthermore, among referential case markers, ?e, si, ta differ from ?o, to in terms of 
identifiability. A referent marked by ?e, si, ta is identifiable to the addressee, because 
they are directly related to the universe of discourse while a referent marked by '/o, to 
may either be identifiable but absent or invisible at Speech time but still or 
unidentifiable because of being newly introduced in the discourse. 

There are some co-occurrence restrictions between case markers and auxiliaries. 
We will illustrate below with a few examples. 
Consider first the following pairs of sentences. 

(34) a. da-ta huhucmasi bom to tactmt 
Itr-3S.Nom every day eat Obi banana 
'He eats a banana every day' 



b. * da-ta huhucmasi bom ta tactmt 
Itr-3S.Nom every day eat Obi banana 

(35) a. da-ta kaebi bent to huv?o 

Itr-3S.Nom happy eat Ob! orange 
'He likes eating oranges' 

b. * da-ta kaebt bont ta huvVo 
!tr-3S.Nom happy eat Obi orange 



Givon (1978.293) gives the following definition of referentiality: "It involves, roughly, the speaker's 
intent to 'refer to' or 'mean' a nominal expression to have non-empty reference - i.e., to 'exist' - within a 
particular universe of discourse. [. ..] If a nominal is 'non-referential' or 'generic' the speaker does not 
have a commitment to its existence within the relevant universe of discourse. Rather, in the latter case, 
the speaker is engaged in discussing the genus or its properties, b' ' does not commit him/herself to the 
existence of any specific individual number of the genus." 
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(36) a. o?a moh-ta s?a da ahtu etamaku to tamaku 

Neg AF-3S. Adv Asp never smoke Obi cigarette 
'He never smoked cigarettes' 

b. * oVa moh-ta s'?a da ahtu etamaku ta tamaku 
Neg AF-3S. Adv Asp never smoke Obi cigarette 

oa functions as an auxiliary in (34)-(35), and as an aspectual marker in (36)". Both a 
as indicate a rupture with Speech time: in the two first examples, oa refers to the 

scanning of a class of occurrences and by implication to the characteristic of the agent 
of the given sentence; in the latter, aa locates an event in the past. As a consequence, 
in each example, to but not la can co-occur with aa. 

Note also the ungrammaticality of yilh) and the semantic variation yielded by the 
substitution of to by la in (38b). How are we to account for these examples ? 

(37) a. mi-?o n?a 6oni ta tacimi 

AF- 1 S. Asp eat Obi banana 
'I'm eating a banana' 

b. * mi-?o n?a 6oni to tacimi 

AF- IS. Asp eat Obi banana 
'I'm eating a banana' 

(38) a. mi-?o cu 6om ta tacimi 

AF-IS. Aspeat Obi banana 
'I have been eating a banana' 

b. mi-'?o cu 6oni to tacimi 
AF- 1 S. Asp eat Obi banana 
'I have eaten a banana' 

In (37), nib which translates as 'still', in co-occurrence with mi-, refers to an event 
still on-going at Speech time: therefore lo cannot co-occur with n'/a. In (38), both (a 
and to can co-occur with cu 'already'. The occurrence of this aspectual marker doesn't 
enable us to determine whether the situation in question is past and completed or still 
on-going at Speech time. Such an interpretation is inferred by the presence of other 
constituents (i.e. case markers). In (38a), the use of ta indicates that the banana is still 
being eaten at '.peech time while that of to implies that it has already been eaten in 
(38b). 



See Zcitoun (1992:53-57) for the syntactic distribution of the two aas in Tsou 
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Conclusion 



In this paper, we have tried to show that all the Formosan languages under study: 

(1) exhibit a complex system of verbal morphology which includes distinctions of 
voice, tense/aspect and modality: in the absence of temporal/aspectual 
affixes/particles determining the temporal frame of the utterance, focus affixes may 
take over temporal/aspectual information and conversely, in the absence of focus 
affixes on the verb, temporal/aspectual affixes/particles may carry voice distinctions, 

(2) have a temporal/aspectual system based on a modal dichotomy between realis 
and irrealis (where AF and NAP constructions are subject to a number of morpho- 
syntactic as well as semantic constraints), 

(3) but display various degrees of complexity. 

In the course of the paper, we have also argued that in order to understand the 
temporal/aspectual and nodal systems of these languages, all the constituents of the 
sentence must be taken into consideration to account for the possible and impossible 
co-occurrences of auxiliaries with pronominal clitics (e.g. Atayal) or with case 
markers (e.g. Tsou). 

This res>=;arch represents, however, a preliminary study: it involves only five 
Formosan languages and more data should be used to test the hypotheses proposed 
here. Various problems still need to be clarified: it was shown, for instance, that in 
Atayal the focus affix -an occurs only in the realis while -un occurs only in the 
irrealis. This is not the case in Bunun where both affixes occur in the realis but still 
do not permute freely. 
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Appendix: Temporal/aspectual and modal systems of each of the language 
under 



study. 



(1) Atayai 
a. Wulai 



Realis 



AF m,-, -um-, 0 

NAF -an 



Irrealis 
P- 



-un 



BF/IF <— 
(3) Bunun (Isbukun) 

Realis 



s- — > 



AF 



NAF 



BF/IF 



m,-, -um-, 0 



Irrealis 



na- + AF or NAF forms 



-an 
-un 



<— ?is- — > 



(4) Rukai (Budai) 

Realis 



-a- 



Irrealis 

(L)i. 
nay- 



(5)Tsou (Tfuea) 



Realis Irrealis 
AF NAF Future/Hypothetical Counterfactual Habitual/Generic 

AF/NAF 



Immediate mi- i- 
mo 

Remote mo(h) o(h) 
moso 



te, tena 
ta 



nte 
nto 



da 
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1. Introduction 

1.1 Theoretical Background 

Recent work on cognitive semantics and lexicalization has drawn substantial attention to 
the correspondence between language and cognition, with a special emphasis on categorization 
(cf. Ukoff 1987; Talmy 1985, 1991; Ungacker 1987, 1991 ;Ono and Thompson 1993). It is 
argued that language is characteristic of human cognitive activities and that there are direct 
pairings of parameters of form with parameters of meaning. To fully understand language, 
one has to probe into the underlying conceptual organization of language. As one central 
aspect of cognition, categorization in general bears gestalt properties and an ecological 
structure. It is neither objectivistic nor atomistic as traditionally thought. Similarly, linguistic 
categories should be viewed as schematic in nature and radially structured, rather than with 
discrete boundaries (Lakoff 1987). 

The notion of radial structure as defined in Lakoff (1987) provides a new perspective to 
the study of Mandarin Verb-Resultative (V-R) compounds (eg. fj^, ^H, , 

etc.). As a grammatical category, V-R compounds display a wide range of morphological, 
syntactic and semantic variations, and prcve to be a productive resource for new complex 
verbs. Early studies on Mandarin V-R compounds focu- mainly on their internal 
morphological and semantic make-ups (cf. Chao 1968; Thompson 1973; Li and Thompson 
1981). More recently, researchers have worked intensively on the argument structure and 
semantic restrictions associated with V-R compounds. Chang (1988, 1991) emphasizes the 
importance of thematic roles in relation to the grammatical properties of V-R's. Lin (1990) 
adopts a semantic approach, deriving collocational and subcaiegorizational regularities of V- 
R's on the basis of a semantic classification of individual verbs. Within the framework of GB 
theory, Li (1990, 1993) proposes a structure-based account, which takes the first V as the head. 
Following the theory of LFG, Huang and Lin (1992) demonstrates a morpholexical approach 
by postulating argument templates and selection rules to match the argument structure with the 
thematic structure. Other studies focus on one specific subcategory of V-R's with a detailed 
characterization of its syntactic or semantic properties (cf. Chang 1993, Gao 1993, Yeh 1993). 

1 .2 The Question 

In their attempts to account for the diverse behavior of V-R compounds, previous 
studies all operate under the assumption that V-R compounds as a grammatical 'category consist 
of a fixed set of "classes', whose behavior can be rigidly formulated and perhaps predicted in 
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terms of the syntactic and semantic properties of the verbal elements. However, given the 
wide rang of diversity and the huge number of V-R's, a more basic question needs to be asked: 
what makes it possible that all these different event complexes with distinct semantic 
combinations share the same surface form, i.e., all being coded as a V-R compound. 
More specifically, what motivates the various subtypes of V-R compound and how are 
they interrelated and perceived as one unique grammatical category? 

1.3 Scope and Goal ot the Paper 

As an attempt to answer the above questions, this paper follows the proposal in Talmy 
(1991) that certain types of event complex are universally amenable to conceptualization as a 
single fused event and, accordingly, to expression by a single clause. Mandarin V-R 
compounds manifest exactly such a conceptual conflation of events (a 'macro-event' in 
Talmy "s terms), which may be established as a component of cognitive-linguistic organization. 
To account for the internal structure of the V R category, various image-schematic models are 
proposed as the conceptual bases for categorial extension. This paper further characterizes the 
cognitive principles that motivate these subtypes, and specifies the conceptual links between 
them. It ultimately shows how the category of V-R compounds can be reconstructed as a 
complex, 'radial' category, with non-discrete boundaries, rather than one whose members can 
be described in terms of a set of shared properties (cf. Lakoff 1987). 

1. 4 The Data 

The analysis of this paper is based on entries of V-R compounds collected in a large 
electronic dictionary of Mandarin, which has been developed by the CKIP group at the 
Institute of Information Science, Academia Sinica. The total number of V-R's in this 
dictionary is about 2430, but not all of them are found in the CKIP corpusi of written 
Mandarin. Below are some V-R examples that occur most frequently in the corpus: 



2. Event Structure and Typological Account 

2.1 Event conflation as a universal cognitive-linguistic process 

The prototypical function of the V-R compound taken as a unit is to report a complex 
event (cf. Hopper and Thompson 1984:736). This complex event consists of two subevents - 
an initial activity (V) and a resultative stage (R), but it is expressed only with a single clause. 
This kind of event structuring is examined in great detail in Talmy (1991). According to 
Talmy, although a complex event is usually partitioned into a main event and a subordinate 

' The CKIP corpus of written Mandarin cnnisits mainly nf newspaper and journal articles, with 
approximaly 20 million characters. 



(1) 



a. v-g!j: mm &\ 

b. V-Ending State: WM W-^ 

c. V-ai/m5l5: mi^ Mt^ 
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event, together with the relation between them, 'there appears to be a general cognitive process 
at work in language whereby an event that under a more analytic conceptualization would be 
understood as complex and represented by a multi-clause syntactic structure can be 
alternatively conceptualized as siniplex and represented by a single clause' (1991:481). This 
process is termed 'conceptual conflation of events', and the conflated event complex is called a 
'macro-event'. 

Within the macro-event, there are two event-components: a main, framing event and a 
subordinate, supporting event. The framing event provides the overarching conceptual 
framework or reference frame for the whole macro-event, and it serves to delineate a certain 
type of schematic structure in a particular set of organized conceptual domains, a function 
called 'domain-schematizing'. Talmy identifies five types of domain schematization that the 
framing event can represent: an event of motion, an event of contouring in time, an event of 
state change, an event of correlation among actions, and an event of fulfillment in action 
realization. The supporting event performs a function of support in relation to the framing 
event, which can be further specified as Cause, Manner, or Purpose, etc. Talmy further 
suggests that the framing event and the macro-event, both representing a conceptual unit 
mapped with a linguistic unit, should be recognized as two components of cognitive-linguistic 
organization (Talmy 1991: 481-87). 

2.2 Typology of Event conflation 

An important claim, in Talmy (1991) is that the existence of the macro-event as a 
cognitive unit and its specific conceptual structuring may be universals of linguistic 
organization. There are two typological accounts concerning the coding of macro-event (1991- 
486-87): 

First, languages can generally be divided into a two-category typology on the basis of 
the characteristic pattern in which the conceptual structure of the macro-event is mapped 
onto syntactic structure. That is, the core schema of the macro-event may be expressed 
either by the main verb or by the satellite2 (or adjunct). English is mentioned as an 
example of satellite- framed language, since it is the particle 'in' in sentences like 'The ball 
rolled in' that expresses the schematic core. Spanish, on the other hand, exemplifies 
a verb-framed language. 

Secondly, the syntactic site - verb or satellite - where Path is characteristically 
expressed is also to a great extent where aspect, state change, action correlation and 
realization are characteristically expressed. 

As will be clear in the following section, these two typological accounts may both 



2 A satelite is defined by Talmy (1991:486) as ' the giammatical category of any constilucnt 
other than a nominal complement that is in a sister relation to the verb root. It can be 
either a bound morpheme or a free word, such as English verb particles. Chinese verb 
complement. German verb prefixes, etc. 
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apply to Mandarin V-R's, showing that the V-R category in Mandarin is not an isolated 
phenomenon. It complies with the universal, cognitive-linguistic process that maps conceptual 
conflation of events with a single syntactic unit. However, a more elaborated account is still 
needed to account for the language-specific structure of the Mandarin V-R's and the 
interrelationship among its subcategories. 

2.2. Domain schematization with Mandarin V-R compounds 

The V-R compound in Mandarin illustrates exactly a macro-event as defined above, 
conceptually conflating a complex event into a simplex one and linguistically coding it with a 
single clause. Following Talmy's typology. Mandarin can be categorized as a satellite-framed 
language: it maps the core schema of a V-R macro-event into the satellite, i.e., the verb 
complement (R). And it is tiue, as Talmy observes, that the satellite may be used to specify 
five types of domain schematization, including path, aspect, state change, action correlations 
and realization. Below are illustrations of each type of domain schematization: 



(2) a. Motion-Path: 

b. Temporal contouring- Aspect: 

c. State change-Changed property: 

d. Action correlation-Correlation: 

e. Action realization-Fulfillment: 



Examples in (2) help to provide some basic ideas regarding the semantic domains commonly 
encoded by V-R's. Nevertheless, the categorical diversity of V-R's is much more 
complicated than (2) may show. For example, the same verb complement may be used with 
either motional or non-motional activities, resulting in some kind of semantic parallelism 
between the two domains: 



(3) Motion: 



a. mm 

b. mtarMM or s) 

c. m±m^ or -s) 

d. mm 

e. m§. 

f. a^mm 



Non-motion a. ^HgiJ 
b. 

c. S-h 

d. mm 

e. 



3 Examples of VR compounds lluit specify action correlation arc oxlrcmcly limited in Mandarin. 
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Given the categorical complexity of V-R's, what needs to be further explored is the 
interrelationships among different semantic domains and the motivation for clustering them 
into the same syntactic category. Beyond merely listing possible semantic subtypes of V-R's, 
the following sections will examine in detail the relationships among the different 
subcategories of V-Rs, and identify cognitive principles that allow the Mandarin V-R 
compound to encompass such a wide range of semantic variations. 



3. V-R Compound as a Radial Category 
3.1 Radial structure 



Lakoff (1987) has made it clear that the grammar of a language is a cognitive 
subsystem. It is dependent on many other aspects of cognition, such as prototypes, cognitive 
models, mental spaces, etc. It is shown that similar to conceptual categories such as the notion 
'mother', linguistic categories in general exhibit categorial structure of a radial type. That is, 
there is a central subcategory and non-central extensions on it. The central case provides the 
best illustration of the category, and the non-central cases are derived by convention as 
variations on the ideal case. The possibilities for extension are by no means random, since 
they are determined or more accurately, motivated by the central model plus certain general 
cognitive principles. 

According to Lakoff (1987: 68, 113-14), there are four types of structuring principles 
that give rise to radial categorization: propositional structure, image-schematic structure, 
metaphorical mappings, and metonymic mappings. These four cognitive principles are 
essential to the characterization of the overall category structure, specifying the central 
members and links between the central and non-central cases. 



The concept of radial structure, as defined above, may also be applied to the categorial 
characterization of Mandarin V-R compounds. As will be shown below. Mandarin V-R's 
consist of a central subcategory and a number of extensions, whose interrelationships may be 
accounted for with image-schematic models and metaphorical/metonymic mappings. 



3.2 Path-Schemas and Metaphorical Links 



It is well established in the literature that bodily experience with the spatial domain 
proves to be most basic in human cognition. It provides the foundation for conceptualization 
of many other semantic domains (Lakoff and Johnson 1980). As mentioned previously, Talmy 
(1991, 1987) also takes the event of motion with a specified Path or Location to be the basic 
type of event conflation. More relevantly, Goldberg (1992) proposes that the constructional 
scheme describing caused motion is mapped unto the expression of resultatives. According to 
Goldberg's analysis of English constructions, the resultative construction in English, which 
marks a 'change of state', is itself a metaphorical extension of the caused-motion construction, 
which marks a 'change of location'. In other words, the semantic pattern 'X causes Y to move 
to Z' is utilized to express "X causes Y to become Z", as illustrated by the two sentences in 
(4): 



er|c 
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(4) a. Caused- motion: Joe kicked the bottle into the yard, 
b. Resultative: Joe kicked Bob black and blue. 



Goldberg's analysis points out one important fact that the way 'change of state' is expressed is 
usually^modeled upon the way locational change is expressed. The Mandarin coding system 
provides even stronger evidence, since in Mandarin, both caused-motion awd resultative can be 
expressed as V-R compounds: 

(5) a. aiE^ 'MU. 'I kicked the ball into the hole.' 

b. |5c?Eftfe WiM T . 'I kicked him and (as a result) he was hurt.' 

The domain of spatial/Locational motion (Motion-Path) provides the basic cognitive 
model for describing other resultative activities, be it physical or non-physical, which is made 
possible via the metaphor ACTIVITY as MOTION, or ACTIVITY as JOURNEY (Ukoff and 
Johnson 1980). The metaphorical transfer highlights the notion 'Path' in motion and the 
corresponding feature 'state change' in most resutative activities. The notion Path may give 
rise to various image-schemas, which are in turn utilized to conceptualize distinct activity 
patterns in non-spatial domains. In (6) below, some of the most common Path-schemas are 
represented, and examples of V-R's are listed to show that the particular Path-schema is 
mapped from the spatial/locational domain (SP) to the domain of physical state (PH) and to the 
domain of mental or perceptual state (M/P): 

(fi) Semantic Domain 



Path-Schema Visual Image SP PH M/P 

Schema 1. Path with an Endpoint ^ SJfM #10 5ra 4' 

^] 

Schema 2. Path with Direction \!^ ^^'^ ^^'^ 

Schema 3. Path through/across a landmark ^ '"^^ MM \WM isCi^i)M 

Schema 4. Path over/under a landmark | |_| #± ^-.l: 

Schema 5. Path into/out of a container - j gfeii Pill I^M 

Schema 6. Path on a surface vl^ - 
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(6) shows that each Path-schema underlines a particular case of spatial motion, 
exemplified by the V-R compound under the column SP, and the particular Path-schema may 
be mapped unto the domain of physical state, evidenced by the example under PH, and then 
unto the mental/perceptual state, evidenced by the example under M/P. Therefore, for 
example, along Schema 5 (Path In/out of a containe:), we have -> ^jg -> fgjg. 
Along each individual Path-schema, the metaphorical links among the different semantic 
domains can be specified as follows: 

(7) Change of location -> Change of physical state -> Change of mental or 
perceptual state 

In the following, a detailed specification of each Path-schema will be given, together with a 
discussions of possible extensions. 

3. 3 Schematic Properties of Path-Schema and Possible Extensions 

3.3.1 Schema 1 (Path-Endpoint) 

Among the proposed image-schemas in (6), Schema 1 (Path-Endpoint) is considered to 
be central, since it provides the most general case of a Path. The centrality of Schema 1 can 
also be seen from the frequent occurrence of this type of V-R's in the CKIP corpus, as listed 
above in (la), and the prototypical V-R's, as exemplified in (lb). 

(1) a, v-fij: ^jfjij ^gij tt/jnj:j ^mm mm mmm 

b. v-Ending State fjif nm Mm #p \m MM 

The major schematic properties in Schema 1 include: a generic trajectory of Path and 
the Endpoint of that Path. In an event of spatial motion, the endpoint is the final location the 
moving entity (be it Agent or Theme) ends up with. This schematic pattern corresponds well 
with the basic semantics of resultatives, namely, an activity gives rise lo a state/property 
(normally with regard to the Patient), as a direct result of the activity. The resultative state (R) 
is conceptualized as signaling the ending point/location of the activity-path. 

The other non-central subcategories of V-R's can all be taken as variations or 
extensions of the central case. They are motivated either by a different Path-schemas or a 
metaphorical/metonymic link. 

3.3.2 Schema 2 (Path-Direction) 

The second Path-schema highlights a directional motion. The Path is directed with a 
specific orientation. This feature of spatial orientation may be utilized to describe activities in 
other domains which can be conceptualized as going upwards or downwards, such as 'filjT^i" Y 

- '?'li!)'7^''5|<:' . To illustrate a further categorial extension based on Schema 2, please 
consider the followini; examples: 
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(8) a. v-m^: mm^M m^x^M m^mm 



Examples in (8a) and (8b) both express Aspect with regard to temporal contouring of a non- 
spatial event: '^E"^' in (8a) adds an inchoative aspect to the action; 'TS' in (8b) signals 
continuation of the activity. Both subcategories exemplified in (8) are built upon Schema 2 
(Path-Direction), with a metaphorical extension from the spatial domain to the temporal 
domain. The spatial direction 'up' is utilized to express a corresponding feature in temporal 
contouring, namely, the very start of an event. The spatial direction 'down' is utilized to 
express the 'carrying-on' of an event. 

Although the transfer from spatial to temporal domain is common and extensive cross- 
linguistically. In Mandarin, only a limited set of path-schemas are utilized to describe temporal 
contouring. This fact indicates one important aspect of radial structure. That is, not all 
possible variations of the central case exit as subcategories. The subcategories are derived by 
culturally-specific conventions and have to be learned. 

Another interesting observation associated with schema 2 is that the derived temporal 
subcategory with the complement -'|E^' is further utilized, via a metonymic transfer, to 
specify the temporal or conditional frame for a descriptive proposition. Consider the 
following examples: 

(9) a. -m-m^. 
b. MsmmMmK-mmm. 

The instances of -'|E5|5' in (9) are usually analyzed as an evaluative use in previous studies (cf. 
Chang 1993, Gao 1993, Yeh 1993). However, the 'evaluative' interpretation actually arises 
from the descriptive statement following the '^5j5' -clause, not from the very use of -',$5515' . 
The unit V- '^B5l5' should be treated as separate fiom the subsequent clause, and its function 
should be analyzed in terms of its relation to the subsequent clause. In my view, the two uses 
of V-'$E,5|5' in (9) should be glossed as 'when he walks,...' and 'when she plays bails,...'. 
Thus, V-'^E5j5' in (9) signals a temporal frame for the subsequent proposition under realis 
mode, or signals a conditional frame under irrealis mode. 

From inchoative to temporal or conditional frame, the directional feature -'^E5l5' 
undergoes a metonymic transfer in that the starting of an activity is used to represent the 
occurrence or existence of that activity, which in turn provides the temporal/conditional frame 
in relation to another proposition. 

3.3.3 Schema 3 (Path Through/ Across a Landmark) 

The third Path-schema contains two salient features: a Reference-Landmark and a 
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trajectory across the Reference-Landmark. Such a path-pattern, when applied to other 
domains, suggests a concept like 'overcome an obstacle', as exemplified in '(^ji^' , ' MjI^C# 

i^y • 

Moreover, the trajectory feature encoded in Schema 3 also provides the conceptual 
basis for describing Experiential aspect, as in 'ni^ji^' : the spatial motion of going through or 
across an object is compared to the temporal experiencing of an activity. 

3.3.4 Schema 4 (Path Over/Under a Landmark) 

The major features in Schema 4 include: a Reference-Landmark and a Trajectory that 
goes to the top or to the bottom of the Landmark. 'To be on top of something' suggests the 
concept 'achievement' or 'higher status' in non-spatial domains, as evidenced in '^_h' , ' @5 
(t#)-h' . On the other hand, the opposite path pattern 'to be under or lower than something' 
suggests 'inferiority' or 'lower status', as in '^jg,' . 

Although the complement '±,' usually describes the concept 'on top', its seeming 
lexical opposite '"f-"' is not frequently used in V-R's to express the opposite concept. Instead, 
the complement ''Y' encodes primarily a downward movement associated with Schema 2. 

3.3.5 Schema 5 (Path Into/Out of a Container) 

Schema 5 highlights the concept of a container and a trajectory into or out of it. This 
schema is commonly seen in languages and traditionally called the Container metaphor. As 
the notion of 'container' may apply almost to any physical object that has a boundary, and 
even to abstract entity such as the mind, this schema is extremely productive in the coining of 
V-R compounds. 

fn terms of lexical choice, the complements 'i§' or 'A' are typically used to express 
the notion 'into', such as ' MMi^MT , ' MM (Hi^)' • On the other hand, the motion of 
coming out of a container may be extended to describe the notion of producing something, as a 
product coming out of a given activity-container. Examples of this sort are: ''Mit\' , ' 'MUS . 

' mmn^,^' ' mmti'>^M' , ' Mm^j?' . etc. 

3.3.5 Schema 6 (Path on a Surface) 

This schema also underlines a common metaphor Path as Surface (Lakoff and Johnson 
1980), on the basis that the totality of all the points covered by a path can be viewed as filling 
up a surface. The spatial feature of covering a surface along a path may be transferred to 
other domains to describe the thoroughness of an activity, as in 'IJcM' . ' ^'M' . 



The suffixes ' or ' are deictic markers that functions primarily to indicate the 
point of view of tiic speaker. Por an intensive and detailed study of these two morphemes, 
see Wei (1994). 
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The above characterization of the Path-schemas and their extensions is aimed to provide 
a fundamental account of the internal categorial structure of the Mandarin V-R compound. 
Although there may be some extensions and V-R subcategories that are not addressed, the 
above discussion should be sufficient to establish the fact that the V-R category is well- 
motivated and radially structured. 



4. Conclusion 

As an attempt to justify various semantically-diverse cases of V-R compound as 
belonging to the same grammatical category, this paper undertakes the task to explain the 
interrelationships among and the motivations behind major subtypes of V-R compounds. It is 
proposed that Mandarin V-R compounds exhibit properties of a radial category by having a 
central subcategory, established along the Path-Endpoint schema, and non-central 
subcategories, established on variations of the central schema plus metaphorical or metonymic 
extensions. 

This work is essential to the understanding of Mandarin V-R compound as a cognitive- 
linguistic component that complies with universal observations on the conceptualization and 
coding of complex events. 
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